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Abstract 



We provide a reduction from revenue maximization to welfare maximization in multi-dimensional 
Bayesian auctions with arbitrary (possibly combinatorial) feasibility constraints and indepen- 
dent bidders with arbitrary (possibly combinatorial) demand constraints, appropriately extend- 
ing Myerson's single-dimensional result [23] to this setting. We also show that every feasible 
Bayesian auction can be implemented as a distribution over virtual VCG allocation rules. A 
virtual VCG allocation rule has the following simple form: Every bidder's type ti is transformed 
into a virtual type fi(ti), via a bidder-specific function. Then, the allocation maximizing virtual 
welfare is chosen. Using this characterization, we show how to find and run the revenue-optimal 
auction given only black box access to an implementation of the VCG allocation rule. We gener- 
alize this result to arbitrarily correlated bidders, introducing the notion of a second-order VCG 
allocation rule. 

We obtain our reduction from revenue to welfare optimization via two algorithmic results on 
reduced form auctions in settings with arbitrary feasibility and demand constraints. First, we 
provide a separation oracle for determining feasibility of a reduced form auction. Second, we 
provide a geometric algorithm to decompose any feasible reduced form into a distribution over 
virtual VCG allocation rules. In addition, we show how to execute both algorithms given only 
black box access to an implementation of the VCG allocation rule. 

Our results are computationally efficient for all multi-dimensional settings where the bidders 
are additive (or can be efficiently mapped to be additive). In this case, our mechanisms run 
in time polynomial in the number of items and the total number of bidder types, but not 
type profiles. This running time is polynomial in the number of items, the number of bidders, 
and the cardinality of the support of each bidder's value distribution. For generic correlated 
distributions, this is the natural description complexity of the problem. The running time can 
be further improved to polynomial in only the number of items and the number of bidders in 
item-symmetric settings by making use of techniques from |15j . 
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1 Introduction 



The multi- dimensional mechanism design problem has received much attention from the economics 
community, and recently the computer science community as well. The problem description is 
simple: a seller has a limited supply of several heterogenous items for sale and many interested 
buyers. The goal is for the seller to design an auction for the buyers to play that will maximize 
her revenue. In order to make this problem tractable (not just computationally, but at all), some 
assumptions must be made. First, we assume that the seller has some bayesian prior D on the types 
of buyers that will show up to the auction. Second, we assume that the buyers have the same prior 
as the seller, and that they will play any auction at a Bayes-Nash Equilibrium. We also assume that 
all buyers are quasi-linear and risk-neutral, terms that are defined formally in Section [2j Finally, 
we say that the goal of the seller is to maximize her expected revenue over all auctions when played 
at a Bayes-Nash Equilibrium. All of these assumptions have become standard with regards to this 
problem. Indeed, all were made in Myerson's seminal paper on revenue-maximizing mechanism 
design where this problem is solved for a single item and product distributions |24j . In addition, 
Myerson introduces the revelation principle, showing that every auction played at a Bayes-Nash 
Equilibrium is strategically equivalent to a Bayesian Incentive Compatible (BIC) direct revelation 
mechanism. In a direct revelation mechanism, each bidder reports a bid for each possible subset 
of items they may receive. A direct revelation mechanism is called BIC if it is a Bayes-Nash 
Equilibrium for each bidder to bid exactly their value for each subset. In essence, Myerson's 
revelation principle says that one only need to consider BIC direct revelation mechanisms rather 
than arbitrary auctions played at a Bayes-Nash Equilibrium to maximize revenue (or any other 
objective for that matter). 

As we depart from Myerson's single-item setting, the issue of feasibility arises. With only a 
single item for sale, it is clear that the right feasibility constraints are simply that the item is 
always awarded to at most a single bidder. With many heterogenous items, there are many natural 
scenarios that we would like to model. Here are some examples: 

1. Maybe the items are houses. In this case, a feasible allocation awards each house to at most 
one bidder, and to each bidder at most one house. 

2. Maybe the items are appointment slots with doctors. In this case, a feasible allocation does 
not award the same slot to more than one bidder, and does not award a bidder more than 
one slot with the same doctor, or overlapping slots with different doctors. 

3. Maybe the items are bridges built at different locations. In this case, a feasible allocation 
awards each bridge to everyone or to no one. 

Sometimes, feasibility constraints are imposed by the supply side of the problem: a doctor 
cannot meet with two patients at once, and a bridge cannot be built for one bidder but not 
another. Other times, feasibility constraints are imposed by the demand side of the problem: no 
bidder wants two houses or two appointments with the same doctor. Without differentiating where 
feasibility constraints come from, we model them in the following way: let A = [m] x [re] denote 
the space of assignments (where denotes that bidder i is assigned item j), and let J 7 be a set 
system on A (that is, a subset of 2 A ) . Then in a setting with feasibility constraints J- , it is possible 
for the seller to simultaneously make any subset of assignments in T . T may be a truly arbitrary 
set system, it need not even be downward-closed. 

As we leave the single-dimensional setting, we also need to consider how a bidder values a 
bundle of multiple items. In general, a bidder may have arbitrarily complicated ways of evaluating 
bundles of items, and this information is encoded into the bidder's type. For the problem to be 
computationally meaningful, however, one would want to either assume that the auctioneer only 
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has oracle access to a bidder's valuation, or impose some structure on the bidders' valuations allow- 
ing them to be succinctly described. Indeed, virtually every recent result in revenue-maximizing 
literature [21 SI El El El [101 [XXI [15], [20] assumes that bidders are capacitated- additive^ In fact, most 
results are for unit-demand bidders. It is easy to see that, if we are allowed to incorporate arbi- 
trary demand constraints into the definition of such bidders can be described in our model as 
simply additive. In fact, far more complex bidders can be modeled as well, as demand constraints 
could instead be some arbitrary set system. Because J- is already an arbitrary set system, we may 
model bidders as simply additive and still capture virtually every bidder model studied in recent 
results, and more general ones as well. In fact, we note that every multi-dimensional setting can be 
mapped to an additive one, albeit not necessarily computationally efficiently!! So while we focus 
our discussion to additive bidders throughout this paper, our results apply to every auction setting, 
without need for any additivity assumption. In particular, our characterization result (Informal 
Theorem [2]) of feasible allocation rules holds for any multi-dimensional setting, and our reduction 
from revenue to welfare optimization (Informal Theorem [1]) also holds for any setting, and we show 
that it can be carried out computationally efficiently for any additive setting. 



Optimal Multi-dimensional Mechanism Design. With the above motivation in mind, we 
formally state the revenue optimization problem we solve. We remark that virtually every known 
result in the multi-dimensional mechanism design literature (see references above) tackles a special 
case of this problem, possibly with budget constraints on the bidders (which can be easily incor- 
porated in all results presented in this paper as discussed in Appendix [H]) , and possibly replacing 
BIC with IC. We explicitly assume in the definition of the problem that the bidders are additive, 
recalling that this is not a restriction if computational considerations are not in place. 

Revenue-Maximizing Multi-Dimensional Mechanism Design Problem (MDMDP): 

Given as input m distributions (possibly correlated across items) T>i, . . . , T> m over valuation vec- 
tors for n heterogenous items and feasibility constraints J-, output a BIC mechanism M whose 
allocation is in T with probability 1 and whose expected revenue is optimal relative to any other, 
possibly randomized, BIC mechanism when played by m additive bidders whose valuation vectors 
are sampled from T> = ^{Di. 

We provide a poly-time black box reduction from the MDMDP with feasibility constraints J- 
to implementing VCG with feasibility constraints T by introducing the notion of a virtual VCG 
allocation rule. A virtual VCG allocation rule is defined by a collection of functions fi for each 
bidder i. fi takes as input bidder i's reported bid vector and outputs a virtual bid vector. When 
the reported bid vectors are v\,... ,v m , the virtual VCG allocation rule with functions {/i}j 6 r m i 
simply implements the VCG allocation rule (with feasibility constraints J 7 ) on the virtual bid 
vectors . . . , f m (v m ). We also note here that implementing VCG for additive bidders is in 

general much easier than implementing VCG for arbitrary bidders]! Our solution to the MDMDP 
is informally stated below, and is formally given as Theorem [6] of Section [6) 

1 A bidder is capacitated-additive if for some constant C her value for any subset S of at most C goods is equal to 
the sum of her values for each item in S, and her value for any subset S of more than C goods is equal to her value 
for her favorite S' C S of at most C goods. 

2 The generic transformation is to introduce a meta-item for every possible subset of the items, and have the 
feasibility constraints (which are allowed to be arbitrary) be such that an allocation is feasible if and only if each 
bidder receives at most one meta-item, and the corresponding allocation of real items (via replacing each meta-item 
with the subset of real items it represents) is feasible in the original setting. Many non-additive settings allow much 
more computationally efficient transformations than the generic one. 

3 When bidders are additive, implementing VCG is simply solving the following problem, which is very well 
understood for a large class of feasibility constraints: every element of A has a weight. The weight of any subset of 
A is equal to the sum of the weights of its elements. Find the max-weight subset of A that is in T . 
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Informal Theorem 1. Let Ajr be an implementation of the VCG allocation rule with respect to 
T (i.e. Ajr takes as input a profile of bid vectors and outputs the VCG allocation). Then for 
all T>i, . . . ,V m with finite support and all T , given T>\, . . . ,V m and black box access to Ajr (and 
without need of knowledge of T), there exists a fully polynomial-time randomized approximation 
schemgfor the MDMDP whose runtime is polynomial in n, the number of bidder types (and not 
type profiles), and the runtime of Ajr. Furthermore, the allocation rule of the output mechanism is 
a distribution over virtual VCG allocation rules. 

We remark that the functions defining a virtual VCG allocation rule may map a bidder type to 
a vector with negative coordinates. Therefore, our given implementation of the VCG allocation rule 
should be able to handle negative weights. This is not a restriction for arbitrary downwards-closed 
J- as any implementation of VCG that works for non-negative weights can easily be (in a black-box 
way) converted into an implementation of VCG allowing arbitary (possibly negative) inputs But 
this is not necessarily true for non downwards-closed J-~'s. If the given Ajr cannot accommodate 
negative weights, we need to replace it with an algorithm that can in order for our results to be 
applicable. 

Several extensions are stated and discussed in Section [6j including solutions for distributions 
of infinite support, and improved runtimes in certain cases that make use of techniques from [15], 
We also extend all our solutions to accommodate strong budget constraints by the bidders in 
Appendix HH So how does our solution compare to Myerson's single-dimensional result? One 
interpretation of Myerson's optimal auction is the following: First, he shows that the allocation 
rule used by the optimal auction is just the Vickrey allocation rule, but on virtual bids instead of 
true bids. Second, he provides a closed form for each virtual transformation using (ironed) virtual 
values. And finally, he provides a closed form pricing rule that makes the entire mechanism BIC 
(in fact, IC). In the multi-dimensional setting, it is known that randomness is necessary to achieve 
optimal revenue, even with a single bidder and two items [6j [12] , so we cannot possibly hope for a 
solution as clean as Myerson's. However, we have come quite close in a very general setting: Our 
allocation rule is just a distribution over virtual VCG allocation rules. And instead of a closed 
form for each virtual transformation and the pricing rule, we provide a computationally efficient 
algorithm to find them. 

Characterization of Feasible Interim Allocation Rules. In addition to our solution of the 
MDMDP, we provide a characterization of feasible interim allocation rules of multi-dimensional 
mechanisms in all (not necessarily additive) settings!! We show the following informal theorem, 
which is stated formally as Theorem [1] in Section [3l Recall that a virtual VCG allocation rule is 
associated with a collection of functions fa that map types t{ to virtual types fa(U) for each bidder 
i, and allocates the items as follows: for a given type vector (ii, t m ), the bidders' types are 
transformed into virtual types (/i(ti), . . . , / m (t m )); then the virtual welfare optimizing allocation 
is chosen. 

Informal Theorem 2. Let J- be any set system of feasibility constraints, andT> any (possibly cor- 
related) distribution over bidder types. Then the interim allocation rule of any feasible mechanism 
can be implemented as the interim rule of a distribution over virtual VCG allocation rules. 

4 This is often abbreviated as FPRAS, and we provide its formal definition in Section [21 

5 The following simple black-box transformation achieves this: first zero-out all negative coordinates in the input 
vectors; then run VCG; in the VCG allocation, un-allocate item j from bidder i if the corresponding coordinate is 
negative; this is still a feasible allocation as the setting is downwards-closed. 

6 For non-additive settings, the characterization is more usable for the purposes of mechanism design when applied 
to meta- items (see discussion above), although it still holds when directly applied to items as well. 
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1.1 Related Work 



1.1.1 Structural Results 

Some structural results are already known for special cases of the MDMDP and its extension to 
correlated bidders. As we have already discussed, Myerson showed that the revenue-optimal auction 
for selling a single item is a virtual Vickrey auction: bids are transformed to virtual bids, and the 
item is awarded to the bidder with the highest non- negative virtual value [23]. It was later shown 
that this approach also applies to all single-dimensional settings (i.e. when bidders can't tell the 
difference between different houses, appointment slots, bridges, etc) as long as bidders' values are 
independent. In this setting, bids are transformed to virtual bids (via Myerson's transformation), 
and the virtual-welfare-maximizing feasible allocation is chosen. These structural results are indeed 
strong, but hold only in the single-dimensional setting and are therefore of very limited applicability. 

On the multi-dimensional front, it was recently shown that similar structure exists in restricted 
settings. It is shown in [8] that when selling multiple heterogenous items to additive bidders with 
no demand constraints (i.e. J- only ensures that each item is awarded to at most one bidder), 
the optimal auction randomly maps bids to virtual bids (according to some function that depends 
on the distributions from which bidders' values are drawn), then separately allocates each item 
to the highest virtual bidder It is shown in [2] that when there are many copies of the same 
customizable item and a matroid constraint on which bidders can simultaneously receive an item 
(i.e. J- only ensures that at most k items are awarded, subject to a matroid constraint on the served 
bidders), that the optimal auction randomly maps bids to virtual bids (according to some function 
that depends on the distributions from which bidders' values are drawn), then allocates the items 
to maximize virtual surplus (and customizes them after). We emphasize that both results, while 
quite strong for their corresponding settings, are extemely limited in the settings where they can 
be applied. In particular, neither says anything about the simple setting of selling houses to unit- 
demand bidders (i.e. T ensures that each house is awarded at most once and each bidder receives 
at most one house: Example 1, Section [I]). Selling houses to unit-demand bidders is on the easy 
side of the settings considered in this paper, as we provide a solution in multi-dimensional settings 
with arbitrary feasibility constraints. We do not even assume that J- is downward-closed. 

For correlated bidders, the series of results by Cremer and McLean |13} H3] and McAfee and 
Reny |23] solve for arbitrary feasibility constraints subject to a non-degeneracy condition on the 
bidder correlation (that is not met when bidders are independent). Under this assumption, they 
show that the optimal auction extracts full surplus (i.e. has expected revenue equal to expected 
welfare) and simply uses the VCG allocation rule (the prices charged are not the VCG prices, 
but a specially designed pricing menu based on the bidder correlation). Our structural results for 
correlated bidders apply to arbitrary feasibility constraints as well as arbitrary bidder correlation, 
removing the non-degeneracy assumption. Of course, the expected revenue extracted by our mech- 
anisms cannot possibly always be as high as the expected maximum social welfare (as it happens 
in Cremer-McLean and McAfee- Reny) as they also apply to independent bidders, but our charac- 
terization is still quite simple: the optimal auction randomly maps pairs of actual bids and possible 
alternative bids to second-order bids. Then, the second-order bids are combined (based on the 
underlying bidder correlation) to form virtual bids, and the virtual- welfare-maximizing allocation 
is chosen. 

7 In fact, the allocation rule of [8] has even stronger structure in that each item is independently allocated to the 
bidder whose virtual value for that item is the highest, and moreover the random mapping defining virtual values for 
each item simply irons a total ordering of all bidder types that depends on the underlying distribution. 
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1.1.2 Algorithmic Results 

The computer science community has contributed computationally efficient solutions to special 
cases of the MDMDP in recent years. Many are constant factor approximations [H HI [TO] QTJ 
120] , These results cover settings where the bidders are unit-demand (or capacitated-additive) and 
the seller has matroid or matroid-intersection constraints on which bidders can simultaneously 
receive which items. All these settings are special cases of the MDMDP framework solved in this 
paperjf) In even more restricted cases near-optimal solutions have already been provided. Tools 
are developed in [3 El [15] that yield solutions for simple cases with one or few bidders. Cases with 
many asymmetric independent bidders are considered in [8] and [2J. As discussed above, in [8], the 
case where J- ensures that each item is awarded at most once is solved. In [2], the case where J- 
ensures that at most k items are awarded, subject to a matroid constraint on the served bidders 
is solved. Our computational results push far beyond existing results, providing a computationally 
efficient solution in multi-dimensional settings with arbitrary feasibility constraints. 

1.1.3 Additional Discussion on Multi-Dimensionality 

In light of similarities between our results and those of [21 [8], we provide a short discussion to 
properly compare the settings where each result applies. We begin with what makes a setting 
single- or multi-dimensional. In a setting with multiple items, two aspects come into play. First, 
there are bidder preferences. Bidder preferences are single-dimensional if bidders cannot tell the 
difference between different items. Specifically, each bidder can be completely described by a single 
value, v, and their value for a feasible allocation that awards them k items is k ■ v. Next, there 
are feasibility constraints. Feasibility constraints are single-dimensional if they only enforce which 
bidders can simultaneously receive how many items, but not which items they receive. Formally, 
this means T is such that if an allocation {(«i, ji), . . . , (ik,jk)} is feasible, so is {(ii, ■ ■ ■ , (ik, j'k)} 
for all (not necessarily distinct) items j[, . . . ,j' k . 

The results of [21 [8] solve the MDMDP for very special types of feasibility constraints. Specif- 
ically, [8] only covers the following setting: the seller has a single copy of each of n heterogenous 
items. Bidders have multi-dimensional preferences and are additive with no demand constraints. 
In [2], the feasibililty constraints are single-dimensional and a matroid, and bidders have multi- 
dimensional preferences and are unit-demand. In this paper, the feasibility constraints may be 
arbitrary and multi-dimensional. Below are some specific examples to emphasize the differences. 
We again remark that the examples below are on the "easy" side of settings covered in this paper, 
as our results allow T to be arbitrary. 

1. There is one copy of a single painting for sale. Then both the feasibility constraints and 
bidder preferences are single-dimensional. This is solved by Myerson |24j . Alaei et. al. [2J, 
Cai et al. [8], and this paper. 

2. There is one copy of each of several (different) paintings for sale, bidders are additive with 
no demand constraints. Then both the feasibility constraints and bidder preferences are 
multi-dimensional. This is solved by Cai et al. [8] and this paper. 

3. There are several copies of the same painting for sale, a matroid constraint on which bid- 
ders may simultaneously receive the painting, and bidders are unit-demand. Then both the 
feasibility constraints and bidder preferences are single-dimensional. This is solved by Myer- 
son |24j . Alaei et al. [2], and this paper. 

8 Again, in some of these results [UHj bidders may also have budget constraints, which can be easily incorporated 
to the MDMDP framework without any loss, as is shown in Appendix[H] and some replace BIC with IC [II 1 1011111120] . 
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4. There are several copies of the same car for sale, and a matroid constraint on which bidders 
may simultaneously receive a car. The seller can freely customize each car in several different 
ways, and bidders are unit-demand. Then the feasibility constraints are single-dimensional, 
and bidder preferences are multi-dimensional|£| This is solved by Alaei et al. [2] and this 
paper. 

5. There are several houses for sale to unit-demand bidders (Example 1, Section [T]). Then both 
the feasibility constraints and bidder preferences are multi-dimensional. This is previously 
unsolved, and solved in this paper. 

6. There are appointment slots with several doctors available (Example 2, Section [TJ. Then both 
the feasibility constraints and bidder preferences are multi-dimensional. This is previously 
unsolved, and solved in this paper. 

In view of the above discussion, the present paper is the first to obtain solutions in a multi- 
dimensional setting where both preferences and feasibility constraints are truly multi-dimensional. 
The settings considered by Myerson |24j are single-dimensional, and the settings considered by 
Alaei et al. [2] have single-dimensional feasibility constraints. While the settings considered by Cai 
et al. [8] do technically have multi-dimensional feasibility constraints, their results are enabled by 
a simple reduction to a setting with single-dimensional feasibility constraints (namely, allocating 
each item independently of the others). Our results are the first of their kind that apply in a truly 
multi-dimensional setting, without single-dimensional feasibility constraints or reductions to such 
settings. 

1.2 Our Approach and Intermediate results 

Since receiving attention from computer scientists, several special cases of the MDMDP have been 
solved computationally efficiently by linear programming [9l 115] . Simply put, these algorithms 
explictly store a variable for every possible bidder profile denoting the probability that bidder i 
receives item j on that profile, and write a linear program to maximize expected revenue subject 
to feasibility and BIC constraints. Unfortunately, the number of variables required for such a 
program is exponential in the number of bidders, making such an explicit description prohibitive. 
More recent solutions have used the reduced form of an auction [21 [8] to sidestep this curse. The 
reduced form of an auction was first studied in [5| 121^122] and contains, for every bidder i, for every 
type A of bidder i, and every item j, the probability that bidder i receives item j when truthfully 
reporting type A over the randomness of the auction and the randomness in the other bidders' 
types, assuming they report truthfully. Indeed, the reduced form auction contains all the necessary 
information to verify that an auction is BIC when bidders are independent, although verifying its 
feasibility (i.e. verifying whether a feasible mechanism exists matching these probabilities) appears 
to be difficult. Despite this difficulty, computationally efficient separation oracles were discovered 
for independent bidders and a single item [21 [8]. The techniques of [2] also accomodate many copies 
of the same item and a matroid constraint on which bidders may simultaneously be served. In 
this paper, we step far beyond both existing results and consider reduced forms in settings with 
arbitrary feasibility constraints. Surprisingly, we are able to provide a simple proof of a strong 
characterization result: for arbitrary feasibility constraints, every feasible reduced form can be 
implemented by a distribution over virtual VCG allocation rules. Our proof is in Section [3] and 
follows the spirit of [U E] '■ we examine the region of feasible reduced forms (we show it is always a 
polytope) and identify special structure in the extreme points of this region. 

9 To see why the feasibility constraints of this setting are single-dimensional, introduce an item for each possible 
customization of the car. Then, if an allocation of customized cars to bidders is feasible, any re-customization of 
these cars is still feasible. 
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In Section[3]we provide a separation oracle for feasible reduced forms, as well as a decomposition 
algorithm to explicitly write any feasible reduced form as a distribution over virtual VCG allocation 
rules in all settings, given only black box access to an implementation of VCG with respect to T . 
In order to make these algorithms exact in all settings, we must use time polynomial in \D\, making 
them practically unusable. In Section (5[ we show how to e-implement both the separation oracle 
and decomposition algorithm in time polynomial in YaLi 1^1 an d l/ e with high probability. By e- 
implementing a separation oracle and decomposition algorithm for polytope P, we mean computing 
a polytope P' such that every point in P is within e/poly(n I^M) (i n ^oo distance) of a point in 
P' and vice versa, and exactly implementing a separation oracle and decomposition algorithm for 
P'. We show that this is sufficient for computationally efficiently deciding whether a reduced form 
that is e-far (in ^oo) from the boundary of P lies inside P, as well as for computing a distribution 
over virtual VCG allocation rules that is within e/poly(n^™ 1 \T>i\) (in distance) of any given 
feasible reduced form that is e-far from the boundary of P. 

In Section [6l we show how to combine the linear programs from [8j [15] with our algorithms for 
reduced forms to obtain an FPRAS for MDMDP using only black box access to an implementation 
of VCG for J 7 . In generic cases, the runtime is polynomial in the number of items and YITLi 
(but not \T>\). In many settings (e.g. when there is correlation among item values, or when the 
value distributions have sparse supports) this is the natural description complexity of the problem, 
and several recent algorithms [H El HI El US] have the same computational complexity, namely 
polynomial in the number of bidders, the number of items and the cardinality of the support of 
each bidder's value distribution. Additionally, by using results of [15j we can reduce the runtime to 
polynomial in only the number of items and number of bidders in item-symmetric settings, as well 
as extend our solution to distributions with infinite support. Our mechanisms can be made interim 
or ex-post individually rational without any difference in revenue. We are also able to naturally 
accommodate hard budget constraints in our solutions. The simple modification that is necessary 
is shown in Appendix ITU 

Finally, in Section we prove a characterization result for correlated bidders. To do this, we 
introduce the notion of a second-order reduced form, and show that every second-order reduced 
form can be implemented as a distribution over second-order VCG allocation rules. With this 
modification, all the related techniques of Section [4] also apply to correlated bidders. 

2 Preliminaries and notation 

We denote the number of bidders by m, the number of items by n, and the type space of bidder i 
by Tj. To ease notation, we sometimes use A (B, C, etc.) to denote the type of a bidder, without 
emphasizing whether it is a vector or a scalar. The elements of XjTj are called type profiles, and 
specify a type for every bidder. We assume type profiles are sampled from a distribution T> over 
XjTj. We denote by T>i the marginal of this distribution on bidder z's type. For independent 
bidders, we use to denote the marginal of V over the types of all bidders, except bidder i. 
For correlated bidders, we use T>-i(vi) to denote the conditional distribution over the types of all 
bidders except for i, conditioned on bidder i's type being Vi. We use ti for the random variable 
representing the type of bidder i. So when we write Pr[ij = A], we mean the probability that bidder 
i 7 s type is A. In Appendix [Bj we discuss how our algorithms access distribution V. 

We let A = [m] x [n] denote the set of possible assignments (i.e. the element denotes that 
bidder i was awarded item j). We call (distributions over) subsets of A (randomized) allocations, 
and functions mapping type profiles to (possibly randomized) allocations allocation rules. We call 
an allocation combined with a price charged to each bidder an outcome, and an allocation rule 
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combined with a pricing rule a (direct revelation) mechanism. As discussed in Section [H we may 
also have a set system T on A (that is, a subset of 2^), encoding constraints on what assignments can 
be made simultaneously by the mechanism. T may be incorporating arbitrary demand constraints 
imposed by each bidder, and supply constraints imposed by the seller, and will be referred to as 
our feasibility constraints. In this case, we restrict all allocation rules to be supported on T . 

The reduced form of an allocation rule (also called the interim allocation rule) is a vector 
function ir(-), specifying values irij(A), for all items j, bidders i and types A G Tj. Ttij(A) is the 
probability that bidder i receives item j when truthfully reporting type A, where the probability is 
over the randomness of all other bidders' types (drawn from D_j in the case of independent bidders, 
and T>_i(A) in the case of correlated bidders) and the internal randomness of the allocation rule, 
assuming that the other bidders report truthfully their types. Sometimes, we will want to think of 
the reduced form as a n^ J ™ =1 1^1 -dimensional vector, and may write 7? to emphasize this view. 

Given a reduced form ir, we will be interested in whether the form is "feasible" , or can be "imple- 
mented." By this we mean designing a feasible allocation rule M (i.e. one that respects feasibility 
constraints T on every type profile with probability 1 over the randomness of the allocation rule) 
such that the probability Mij(A) that bidder i receives item j when truthfully reporting type A is 
exactly TTij(A), where the probability is computed with respect to the randomness in the allocation 
rule and the randomness in the types of the other bidders, assuming that the other bidders report 
truthfully. While viewing reduced forms as vectors, we will denote by F(T, T>) the set of feasible 
reduced forms when the feasibility constraints are J- and consumers are sampled from T>. 

A bidder is additive if her value for a bundle of items is the sum of her values for the items in that 
bundle. If bidders are additive, to specify the preferences of bidder i, we can provide a valuation 
vector Vi, with the convention that Vij represents her value for item j. Even in the presence of 
arbitrary demand constraints, the value of additive bidder i of type Vi for a randomized allocation 
that respects the bidder's demand constraints with probability 1, and whose expected probability 
of allocating item j to the bidder is ir^, is just the bidder's expected value, namely ^ - -ity. The 
utility of bidder i for the same allocation when paying price pi is just J2j v ij ' ^ij ~Pi- Such bidders 
whose value for a distribution of allocations is their expected value for the sampled allocation are 
called risk-neutral. Bidders subtracting price from expected value are called quasi-linear. 

Throughout this paper, we denote by OPT the expected revenue of an optimal solution to 
MDMDP. Also, most of our results for this problem construct a fully polynomial-time randomized 
approximation scheme, or FPRAS. This is an algorithm that takes as input two additional param- 
eters e, rj > and outputs a mechanism (or succinct description thereof) whose revenue is at least 
OPT — e, with probability at least l — rj (over the coin tosses of the algorithm), in time polynomial 
in nJ2i \Ti\, 1/e, and log(l/r/). 

Finally, some arguments will involve reasoning about the bit complexity of a rational number. 
We say that a rational number has bit complexity b if it can be written with a binary numerator 
and denominator that each have at most b bits. Also, for completeness, we define in Appendix lAl 
the standard notion of Bayesian Incentive Compatibility (BIC) and Individual Rationality (IR) of 
mechanisms for independent bidders, and state a well-known property of the Ellipsoid Algorithm 
for linear programs. 

3 Characterization of Feasible Reduced Forms 

In this section, we provide our characterization result, showing that every feasible reduced form can 
be implemented as a distribution over virtual VCG allocation rules. For space considerations, all 
proofs of this section are in Appendix In the following definition, VCGjr denotes the allocation 
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rule of VCG with feasibility constraints T . That is, on input v = (v*i, . . . , v m ), VCGjr outputs the 
allocation that VCG selects when the reported types are v. 

Definition 1. A virtual VCG allocation rule is defined by a collection of weight functions, /j : 
Tj — > M n . fi maps a type of bidder i to a virtual type of bidder i. On any type profile v, the 
virtual VCG allocation rule with functions {/i}jg[ m | runs VCGjr on input (fi(vi), . . . , / m (^m))o 
VVCGjr({fi} ie ^ m j) denotes the virtual VCG allocation rule with feasibility constraints T and weight 
functions {fi} ie [ m ]. 

In other words, a virtual VCG allocation rule is simply a VCG allocation rule, but maximizing 
virtual welfare instead of true welfare. It will be convenient to introduce the following notation, 
viewing the weight functions as a (scaled) |Tj| -dimensional vector. Below, denotes the 

jth component of fi. 

Definition 2. Let w G R n ^T=i\ T i\. Define fi so that ft^A) = p r ff A] ■ Then VVCGj{w) is the 
virtual VCG allocation rule VVCGj-({fi}i G [ m j). 

It is easy to see that every virtual VCG allocation rule can be defined using the notation of 
Definition [2] by simply setting Wij(A) = fij(A) ■ Pr[ij = A]. We scale the weights this way only 
for notational convenience (which first becomes useful in Lemma [1]). We say that a virtual VCG 
allocation rule is simple iff, for all v\, . . . , v m , VCGj^(fi(v±), . . . , f m (v m )) has a unique max-weight 
allocation. We now state the main theorem of this section, which completely characterizes all 
feasible reduced forms. 

Theorem 1. Let T be any set system of feasibility constraints, andT> be any (possibly correlated) 
distribution over bidder types with finite support. Then every feasible reduced form (with respect to 
T and T>) can be implemented as a distribution over at most n^iLi l^il + 1 simple virtual VCG 
allocation rules. 

Before outlining the proof, we provide a brief example illustrating the content of Theorem [TJ 
We are not claiming that every feasible allocation rule can be implemented as a distribution over 
virtual VCG allocation rules. This is not true. What we are claiming is that every feasible allocation 
rule has the same reduced form as some distribution over virtual VCG allocation rules. Consider 
a scenario with a single item and two bidders each with two types, A and B that are sampled 
independently and uniformly at random. If M is the allocation rule that awards bidder 1 the item 
when the types match, and bidder 2 the item when they don't, then M cannot be implemented 
as a distribution over simple virtual VCG allocation rules. Because bidder 1 gets the item when 
both types match, we must always have wu(A) > W2i(A) and w\\(B) > W2i(B). Similarly, 
because bidder 2 gets the item when the types don't match we must have W2i(A) > wn(B) and 
W2i(B) > wn(A). Clearly, no weights can simultaneously satisfy all four inequalities. However, 
there is a distribution over simple virtual VCG allocation rules with the same reduced form! 11 ! 
The proof of Theorem [JJ begins with a simple observation and proposition, whose proofs are in 
Appendix [Cl 

Observation 1. An allocation rule is feasible if and only if it is a distribution over feasible deter- 
ministic allocation rules. 

10 If there are multiple VCG allocations, break ties arbitrarily, but consistently. A consistent lexicographic tie- 
breaking rule is discussed in Section \3. II For concreteness, the reader can use this rule for all results of this section. 
"Specifically, the reduced form of M is \ ■ f . If we define w[^(A) = w[\\B) = 1, w$ (A) = w$(B) = 0, and 

(2) ('2) (2) (2) 

w\ 1 (A) = w\ 1 (B) — 0, W21 (A) = W21 (B) — 1, then the allocation rule that chooses uniformly at random between 
VVCGt(w {1) ) and VVCG F {w (2) ) also has reduced form \ ■ 1. 
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Proposition 1. If [D\ is finite, F^J 7 ,!)) is a convex polytope. 

Now that we know that F(F, T>) is a convex polytope, we want to look at the extreme points by 
examining, for any w, the allocation rule of F{J-,T>) whose reduced form maximizes fr-w. LemmaQ] 
and Proposition [2] characterize the extreme points of F(F, T>), which allows us to prove Theorem[TJ 
All three proofs are simple, and provided in Appendix [Cl 

Lemma 1. Let tt be the reduced form of VVCGj^{w) (with an arbitrary tie-breaking rule) when 
bidders are sampled from T>. Then, for all tt' € F(F, T>), tt ■ w > tt' ■ w. 

Proposition 2. Every corner of F(F, T>) can be implemented by a simple virtual VCG allocation 
rule, and the reduced form of any simple virtual VCG allocation rule is a corner of F(J-,T>). 

We conclude this section by providing necessary and sufficient conditions for feasibility of a 
reduced form. The proof is again simple and given in AppendixO For the following statement, for 
any weight vector w £ lR n ^=i l T< l, Wjr{w) denotes the total expected weight of items awarded by 
VVCGf{w) (where we assume that the weight of giving item j to bidder i of type A is fij(A) = 
Wij (A) I Pr [t i = A]). The proof of Lemma [1] implies that the tie-breaking rule used in VVCGj^{w) 
does not affect the value of Wj^(w), and that no feasible allocation rule can possibly exceed Wj^(w). 
The content of the next corollary is that this condition is also sufficient. 

Corollary 1 . A reduced form tt is feasible ( with respect to T and T>) if and only if, for all w £ 
[-1, i \H\ f jf . < Wjr(w). 

3.1 Tie-breaking 

Here we discuss tie-breaking. This is important in later sections because we will want to argue that 
any virtual VCG allocation rule we use is simple. Because we only have black-box access to Aj, 
we do not necessarily have any control over the tie-breaking rule used, which could be problematic. 
Instead, we would like to enforce a particularly simple tie-breaking rule by changing w to w' such 
that VVCGjr{w') also maximizes tt ■ w over all reduced forms in F^, V), and VVCGj^(w') is 
simple. Additionally, we would like the bit complexity of coordinates of w' to be polynomial in 
the bit complexity of coordinates of w. Lemma H] in Appendix O states formally that a simple 
lexicographic tie-breaking rule can be implemented in the desired manner. From now on, whenever 
we use the term VVCGj^(w), we will implicitly assume that this tie-breaking rule has been applied. 
Sometimes we will explicitly state so, if we want to get our hands on w' . 

4 Algorithms for Reduced Forms 

The characterization result of Section [3] hinges on the realization that F^J 7 ,!)) is a polytope whose 
corners can be implemented by especially simple allocation rules, namely simple virtual VCG allo- 
cation rules. To compute the reduced form of an optimal mechanism, we would like to additionally 
optimize a linear objective (expected revenue) over F(F,T>), so we need a separation oracle for 
this polytope. Additionally, once we have found the revenue-optimal reduced form in F(JF, T>), we 
need some way of implementing it. As we know that every corner of F(F,T>) can be implemented 
by an especially simple allocation rule, we would like a way to decompose a given feasible reduced 
form into an explicit convex combination of corners (which then corresponds to a distribution over 
simple virtual VCG allocation rules). In this section, we provide both algorithms. For now, we 
will not worry about the running time of our algorithms, but just provide a generic framework 
that applies to all settings. In Section [5] we will describe how to approximately implement these 
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algorithms efficiently with high probability obtaining an FPRAS with only black box access to an 
implementation of the VCG allocation rule. The same algorithms with the obvious modifications 
also apply to "second-order reduced forms" , using the techniques of Section [7J 

4.1 Separation Oracle 

We know from Corollary [T] that if a reduced form 7? is infeasible, then there is some weight vector 
w G [-l ) l]«Efcil^l suc h that tt ■ w > Wp{w). Finding such a weight vector explicitly gives us 
a hyperplane separating tt from F{JF, T>), provided we can also compute Wj?(w). So consider the 
function: 

g# (w) = Wjr(w) - 7T • w. 

We know that tt is feasible if and only if g^{w) > for all w G [—1, l] n ^;=i l Ti L So the goal of 
our separation oracle SO is to minimize g^{w) over the hypercube, and check if the minimum is 
negative. If negative, the reduced form is infeasible, and the minimizer bears witness. Otherwise, 
the reduced form is feasible. To write a linear program to minimize g^(w), recall that Wj^(w) = 
max£ e ^(jr-p){x • w}, so g^w) is a piece-wise linear function. Using standard techniques, we could 
add a variable, t, for Wj^{w), add constraints to guarantee that t > x ■ w for all x G F(F, D), and 
minimize t — n-w. As this is a burdensome number of constraints, we will use an internal separation 
oracle SO, whose job is simply to verify that t > x ■ w for all x G F(P, T>), and output a violating 
hyperplane otherwise. 

To implement SO, let Rj^{w) denote the reduced form of VVCGj^{w). Then we know that 
Rp{w) ■ w > x ■ w for all x G F^,!)). So if any equation of the form x ■ w < t is violated, 
then certainly Rjr(w) ■ w < t is violated. Therefore, for an input w, t, we need only check a single 
constraint of this form. So let SO(w,t) output "yes" if Rjr{w) ■ w < t, and output the violated 
hyperplane Rj^(w) ■ z — y < otherwise. SO allows us to reformulate a more efficient linear program 
to minimize g^(w). This LP is explicitly shown in Figure Q] of Appendix [Dl 

So our separation oracle SO to check if tt G F{T, T>) is as follows: run the linear program of 
Figured] to minimize g^{w). Let the optimum output by the LP be t*,w*. If the value of the LP 
is negative, we know that w* ■ tt > t* = Wj?(w*), and we have our violated hyperplane. Otherwise, 
the reduced form is feasible, so we output "yes." 

We conclude this section with a lemma relating the bit complexity of the corners of F(J-,D) to 
the bit complexity of the output of our separation oracle. This is handy for efficiently implementing 
our algorithms in later sections. The proof is simple, and provided in Appendix [Dj We make use a 
standard property of the Ellipsoid algorithm (see Theorem 1 10p . 

Lemma 2. If all coordinates of each corner of F(J-,D) are rational numbers of bit complexity 
i, then every coefficient of any hyperplane output by SO is a rational number of bit complexity 
poly(n YT=i \ T i\J)- 

4.2 Decomposition Algorithm via a Corner Oracle 

We provide an algorithm for writing a feasible reduced-form as a convex combination of corners of 
F(J-,T>), i.e. reduced forms of simple virtual VCG allocation rules. A decomposition algorithm for 
arbitrary polytopes P is already given in [8], and the only required ingredients for the algorithm is 
a separation oracle for P, corner oracle for P, and bound b on the bit complexity of the coefficients 
of any hyperplane that can possibly be output by the separation oracle. The goal of this section 
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is to define both oracles and determine b for our setting. But let us first recall the result of [8]. 
Before stating the result, let us specify the required functionality of the corner oracle. 

The corner oracle for polytope P takes as input k (where k is at most the dimension, in our case 
n^2i \Ti\) hyperplanes Hi, ... , H^ (whose coefficients are all rational numbers of bit complexity b) 
and has the following behavior: If no hyperplane intersects P in its interior and there is a corner 
of P that lies in all hyperplanes, then such a corner is output. Otherwise, the behavior may be 
arbitrary. Below is the theorem from [8]. 

Theorem 2. (\B$) Let P be a d- dimensional polytope with corner oracle CO and separation oracle 
SO such that each coefficient of every hyperplane ever output by SO is a rational number of bit 
complexity b. Then there is an algorithm that decomposes any point x € P into a convex combination 
of at most d+1 corners of P. Furthermore, if £ is the maximum number of bits needed to represent 
a coordinate of x, then the runtime is polynomial in d,b,£ and the runtimes of SO and CO on 
inputs of bit complexity poly(d,b,£). 

So all we need to do is define CO and SO, and provide a bound on the bit complexity of 
the hyperplanes output by SO. We've already defined SO and bounded the bit complexity of 
hyperplanes output by it by poly(n^™ 1 |Tj|,^), where £ is the maximum number of bits needed to 
represent a coordinate in a corner of F(T, T>) (see Lemma [2] of Section f4.ip . So now we define CO 
and state its correctness in Theorem [3] whose proof is in Appendix [Dj In the last line, CO outputs 
the weights w 1 as well so that we can actually implement the reduced form that is output. 

Algorithm 1 Corner Oracle for F{J-,T>) 

1: Input: Hyperplanes {w\,h\), (w a , K), a < nY^Li \ T i\- 
2: Set w = Y* =l \wj. 

3: Use the tie-breaking rule of Section 13.11 (stated formally in Lemma H] of Appendix [C]) on w to 
obtain w' . 

4: Output the reduced form of VVCGj^(w'), as well as w'. 



Theorem 3. The Corner Oracle of Algorithm U\ correctly outputs a corner of F(T, T>) contained 
in C\j =1 Hj whenever the hyperplanes Hi, ... , H a are boundary hyperplanes o/F(J r , P) and (lj =1 Hj 
contains a corner. Furthermore, if all coordinates of all Hj are rational numbers of bit complexity 
b, and Pr[ij = A] is a rational number of bit complexity £ for all i,A G Tj, then every coordinate of 
the weight vector w' is a rational number of bit complexity poly(n^2"Li \ Ti\,b,£). 

5 Efficient Implementation of Algorithms for Reduced Forms 

In this section, we show how to approximately implement the separation oracle (SO) of Section H. II 
and the corner oracle (CO) of Section [4.21 efficiently with high probability, thereby obtaining also an 
approximate decomposition algorithm for F(J-, D). We begin by bounding the runtime of an exact 
implementation, showing that it is especially good when T> is a uniform (possibly non-product) 
distribution of small support. As above, Ajr denotes an algorithm that implements the VCG 
allocation rule with respect to feasibility constraints T , and rtjifS) denotes the runtime of Ajr 
when each input weight has bit complexity b. 

5.1 Exact Implementation 

The only tricky step in implementing SO and CO is computing Rjr(w) for a given w. A simple 
approach is to just enumerate every profile in the support of V and check if VVCGp{w) awards 
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bidder i item j. This can be done in time polynomial in the cardinality \T>\ of the support of T>, the 
bit complexity i of the probabilities used by V and rtjr(poly(b, £)), where b is the bit complexity 
of uTs coordinates. So, if b is an upper bound on the bit complexity of the coordinates of the 
weight vectors w for which Rjr(w) is computed in an execution of SO (CO), then SO (CO) can 
be implemented in time polynomial in n |Tj|, \T>\, I, b, c, and rtj?(poly(b, £)), where c is the 
bit complexity of the numbers in the input of SO (CO). Alone, this result is not very helpful 
as we can do much more interesting computations in time polynomial in \T>\, including exactly 
solve MDMDP [15]. The interesting corollary is that when T> is a (possibly correlated) uniform 
distribution over a collection of profiles (possibly with repetition) whose number is polynomial 
in Ji-X^il^l) the runtime of all algorithms of Section |4] becomes polynomial in n^jTjl, c, and 
rtjr(poly(n Y2i l^i|> c ))j where c is the bit complexity of the numbers in the input to these algorithms. 
Corollaries [2] and [3] in Appendix [E] quantify this statement precisely. It is these corollaries that 
enable an efficient approximation for arbitrary distributions in the next section. 

5.2 Approximate Implementation 

Now, we show how to "approximately implement" both algorithms in time polynomial in only 
XXal^ili where \T>i\ is the cardinality of the support of T>. t , using the results of Section [5TT1 
But we need to use the right notion of approximation. Simply implementing both algorithms 
approximately, e.g. separating out reduced forms that are not even approximately feasible and 
decomposing reduced forms that are approximately feasible, might not get us very far, as we could 
lose the necessary linear algebra to solve LPs. So we use a different notion of approximation. We 
compute a polytope P' that, with high probability, is a "good approximation" to F(F, V) in the 
sense that instead of optimizing over F(T, D) we can optimize over P' instead. Then we implement 
both the separation and the decomposition algorithms for P' exactly so that their running time is 
polynomial in n, Y^iLi l^l> c an d rtjr(poly(n YliLi l^il> c ))> where c is the number of bits needed to 
describe a coordinate of the input to these algorithms. 

Approach: So how can we compute an approximating polytope? Our starting point is a natural 
idea: Given an arbitrary distribution T>, we can sample profiles Pi, . . . , Pk from T> independently at 
random and define a new distribution T>' that samples a profile uniformly at random from P\ , . . . , P^ 
(i.e. chooses each with probability 1/k). Clearly as k — > oo the polytope F(T,V) should 
approximate F(J-,T>) better and better. The question is how large k should be taken for a good 
approximation. If taking k to be polynomial in |Tj| suffices, then Section [5.11 (specfically, 

Corollaries [2] and [3] of Appendix [E]) also implies that we can implement both the separation and 
the decomposition algorithms for F(F, V) in the desired running time. 

However this approach fails, as some types may very well have Pr[ij = A] << po iy( n vj" 1 \t \) ' 
Such types likely wouldn't even appear in the support of T>' if k scales polynomially in n Y^iLi l^il- 
So how would then the proxy polytope F(T,V) inform us about F(J-,T>) in the corresponding 
dimensions? To cope with this, for each bidder i and type A £ T{, we take an additional k' samples 
from T)-i and set ij = A. D' still picks uniformly at random from all k + k! l^il profiles. 

Now here is what we can guarantee. In Corollary [6] (stated and proved in Appendix IF .lj) . we 
show that with high probability every n in F(F,T>) has some n' £ F(F,T>') with | 7T — tt'Ioo small. 
This is done by taking careful concentration and union bounds. In Corollary [7] (stated and proved 
m Appendix E2]), we show the converse: that with high probability every tt' £ F(J-, T>') has some 
7? G F(F, V) with | 7T — tt'Ioo small. This requires a little more care as the elements of F(J~, T) f ) are 
not fixed a priori (i.e. before taking samples from T> to define T>'), but depend on the choice of 
D', which is precisely the object with respect to which we want to use the probabilistic method. 
We resolve this apparent circularity by appealing to some properties of the algorithms of Section [4] 
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(namely, bounds on the bit complexity of any output of SO and CO). Finally, in Theorems U] 
and [5] (stated below and proved in Appendix IF.3j) . we put our results together to prove that our 
approximations behave as desired while taking k and k' both polynomial in n^JlLi |Tj|, thereby 
achieving the desired runtime. 

In the following theorems, Algorithm [2] refers to a pre-processing algorithm (see Appendix IF. 3P 
that explicitly chooses k and k', both polynomial in n.^jTj|, so that the polytopes F(F, T>) 
and F(F,T>') are close with high probability. Algorithm [3] refers to a decomposition algorithm 
(Appendix IF. 3ft combining the geometric algorithm of [8] with some bookkeeping to decompose 
any reduced form in F(F, D') into an explicit distribution over simple virtual VCG allocation 
rules. Also in Appendix IF.3I are Corollaries [8] and [9l which show that Theorems [4] and imply that 
using D' as a proxy yields an approximate separation oracle and decomposition algorithm for T>. 

Theorem 4. Given the choice ofk, k' and T>' in Algorithm® the following are true with probability 
at least 1 - e - n ^"=i PilA). 

1. For all 7? G F(F,V), there is a tt' G F(F,V) with |vr - vf' U < e. 

2. For all tt' G F(F,T>'), there is a tt G F(F,T>) with \ jf - vr'^ < e. 

Moreover the separation oracle of Section \4.1\ for feasibility set F and distribution V runs in time 
polynomial inn,Y^iL\\Fi\, 1/e, c, and rtj^(poly(n Y1T=\ l^l> 1°§ V e > c ))> where c is the bit complexity 
of the coordinates of its input. 

Theorem 5. Algorithm^ has the following property on input tt' G F(F,T>') with probability at least 
1 — e -0( - n ^-' i = 1 l Ti l/ e ); Let tt denote the reduced form of the output allocation rule when consumers 
are sampled from T>. Then \ tt — tt'|oo — £• Furthermore, the running time of the algorithm is 
polynomial in n, YllLi l^ili l/ e ; c ; an d rtjr{poly{n Y^Lx | | , log 1/e, c)), where c is the bit complexity 
of the coordinates of its input. 

Remark 1. Observe that, despite the non-canonical way Theorems^ and\£& are stated for FPRASs, 
the dependence of the running time on the approximation error and the failure probability is the 
typical one. Namely, using the stated results as black box we can simultaneously achieve error 
probability at most n and approximation error at most e in time polynomial in logl/r/, 1/e, n, 
Yli\Ti\, c and rtj-(poZy(n ^™ x |Tj|, log 1/e, log log(l/r/), c)), where c is the bit complexity of the 
coordinates of the input to our algorithms. 

6 Revenue-Maximizing Mechanisms 

In this section we describe how to use the results of Section [5] to obtain computationally efficient 
nearly-optimal solutions to MDMDP using only black box access to an implementation of the VCG 
allocation rule. As our notable contribution to obtain these results is the techniques of Sections [3] 
through [5j we only state our results here. In Appendix [G] we provide a high-level overview of how 
to combine our results with the LPs of [El [15] to solve the MDMDP, followed by proofs. In all 
theorem statements, the allocation rule of the mechanism output by our algorithm is a distribution 
over simple virtual VCG allocation rules. There is no special structure in the pricing rule, it is 
just the output of a linear program. As usual, we denote by Ajr an algorithm that implements the 
VCG allocation rule with feasibility constraints F, and denote by rtj^(b) the runtime of Aj when 
each input weight has bit complexity b. We note that the mechanisms output by the following 
theorems can be made interim or ex-post individually rational without any difference in revenue. 
We are also able to accommodate bidders with hard budget constraints in our solutions. The 
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proofs presented in Appendix [G] provide interim individually rational mechanisms without budget 
constraints. Ex-post individual rationality and budgets are discussed in Appendix iHl 

Theorem 6. For all e, n > 0, all D of finite support in [0, l] nm , and all T ' , given T> and black box ac- 
cess to Ajr there is an additive FPRAS for MDMDP. In particular, the FPRAS obtains expected rev- 
enue OPT—e, with probability at least 1— rj, in time polynomial ini, m, n, max i€ [ m ]{|Tj|}, 1/e, log(l/r/) 
and rtjr{poly(nY^iLi \Ti\, log 1/e, log log (1/77), £)), where £ is an upper bound on the bit complexity 
of the coordinates of the points in the support of T>, as well as of the probabilities assigned by 
T>i, . . . ,T> m to the points in their support. The output mechanism is e-BIC, its allocation rule is a 
distribution over simple virtual VCG allocation rules, and it can be implemented in the afore-stated 
running time. 

Theorem 7. For all e,r] > 0, all item- symmetric T> of finite support in [0, l] nm , and all item- 
symmetric JF 1^1 given T> and black box access to Aj, there is an additive FPRAS for MDMDP. 
The FPRAS obtains expected revenue OPT—e, with probability at least l — n, in time polynomial in 
£, m, n c , 1/e, log I/77 and rtjr(poly(n°m, log 1/e, log log(l/r/), £)), where c = maxjj \T>ij \, where [Dij\ 
is the cardinality of the support of the marginal of T) on bidder i and item j, and £ is as in the 
statement of Theorem® The output mechanism is e-BIC, its allocation rule is a distribution over 
simple virtual VCG allocation rules, and it can be implemented in the afore-stated running time. 

Theorem 8. For all e,r/,5 > 0, all item- symmetric T> supported on [0, l] nm and all item- symmetric 
T ' , given T> and black box access to Aj, there is an additive bi-criterion PRAS algorithm for MD- 
MDP with the following guarantee: If C is the maximum number of items that are allowed to be al- 
located simultaneously by T , the algorithm obtains expected revenue OPT- (y/i + Vd)C, with prob- 
ability l — n, in time polynomial in m, n 1 ^, 1/e, log(l/??), and rtj-(po/2/(n 1//<5 m, log 1/e, log log 
In particular, the runtime does not depend on \T>\ at all). The output mechanism is e-BIC, and 
can be implemented in the afore-stated running time. 

Remark 2. The assumption that T> is supported in [0, l] mn as opposed to some other bounded set 
is w.l. o.g., as we could just scale the values down by a multiplicative f ma x' 

This would cause the 

additive approximation error to be et; max . In addition, the point of the additive error in the revenue 
of Theorem^ is not to set e,5 so small that they cancel out the factor of C, but rather to accept 
the factor of C as lost revenue. For "reasonable" distributions, the optimal revenue scales with C, 
so it is natural to expect that the additive loss should scale with C as well. 

7 Characterization for Correlated Biddders 

Here we provide the analogue of Theorem [T] for correlated bidders. We begin by observing that, 
in fact, Theorem Q] already holds for correlated bidders. The reduced form is still well-defined, 
and nothing about the proofs in Section [3] requires independence across bidders. What's wrong is 
that the information contained in the reduced form is not sufficient for correlated bidders. Indeed, 
for independent bidders, the information contained in the reduced form is sufficient to verify both 
feasibility (due to Section [4.ip and bayesian incentive compatibility. For correlated bidders, while 
feasibility can still be verified, bayesian incentive compatibility cannot. This is because in order for 
bidder i to decide whether she wishes to report type A or type B when her true type is B, she needs 
to know the probability of receiving each item if she reports A, conditioned on the fact that the 
remaining bidders are sampled according to the conditional distribution induced by ij = B. This 

12 Distributions and feasibility constraints are item-symmetric if they are invariant under every item permutation. 
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information is simply not contained in the reduced form. To cope with this issue, we first propose 
an extended definition of reduced form for the case of correlated bidders, and a proper analogue of 
a virtual VCG allocation rule. 

Definition 3. A second- order reduced form is a vector valued function ir(-) such that nij(A, B) 
denotes the probability that bidder i receives item j when reporting type A, where the probability 
is taken over the randomness of the mechanism and the other bidders' types, assuming they are 
sampled from and bid truthfully. 

Definition 4. A second-order VCG allocation rule is defined by a collection of second-order 
weight functions Wij : Tj x Tj — > R. Wij maps a reported type of bidder i and true type of bidder i to 
a second-order bid for item j. On any profile v, the second-order VCG allocation rule with weights 
w (denoted SOVCGj^(w)) on input v selects the max-weight feasible allocation using the weights: 

fij(v) = Yl w ij {v i ,B)Pi[v^ <- V-i(B)] 
Ber r 

We say that a second-order VCG allocation rule with weights w is simple, if on every profile v, 
there is a unique max-weight allocation, where the weight for allocating item j to bidder i is fij(v). 
We now quickly observe a connection between second-order VCG allocation rules and virtual VCG 
allocation rules when bidders are independent, and follow with a statement of the analogue of 
Theorem [T] for second-order reduced forms. 

Observation 2. If bidders are independent, then for any second-order weight vector w and virtual 
weight vector w' with w' i j(A) = ^BgTi w ijiA, B), the allocation rules VVCGjr{w') and SOVCGj?(w) 
are identical for all T ' . 

Proof. When bidders are independent, Pi[v-i ^— V-i(B)] = Pr[w_j £>_j] for all B. In addition, 
Pr[u_j <- T>-i] = Pt[v <- V]/Pt[U = v t ]. Therefore, the weight fj used by SOVCGjr{w) on bid 
vector v is just: 

Prjgj-P] ^ 

1 1 n BeT, 

The weight f^ used by VVCG T {w") is: 

fl j = w' ij (v i )/Pr[t i = v i ] 

= fijjv) 
Pr[v^- V] 

So the weights used by VVCGjr{w') are proportional to the weights used by SOVCGj^(w) and 
they will choose the same allocation on every profile. □ 

Theorem 9. Let T be any set system of feasibility constraints, and T> any arbitrarily correlated 
distribution over consumer types with finite support. Then every feasible second-order reduced form 
(with respect to F and V) can be implemented by a distribution over at most YliLi l^i 1 2 + 1 simple 
second-order VCG allocation rules. 
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The proof of Theorem [9] parallels that of Theorem [TJ We begin by observing that Observation Q] 
and Proposition [TJ also hold in this setting and the proofs are identical. We denote the polytope of 
feasible second-order reduced forms with respect to J- and T> by SO(J-,T>). We now characterize 
the corners of S , 0(J 7 , T>), beginning with an analogue of Lemma [TJ 

Lemma 3. Let n be the second-order reduced form of SOVCGj^(w) with respect to T>. Then for 
all 7r' G SO{F,V): 

TT ■ W > 7T • W 

Proof. Consider any allocation rule M with second-order reduced form it" and denote by Mij(v) 
the probability that M awards item j to bidder i on profile v. Then we can expand 7?" • w as: 

j i AeTi BeTi 

= E E E E w v( A > B ) E Pr P- <" V^Wi^; A) 

j i A£TiB&Ti v_i 

= EEE M ^^)- E m 3 (v l ,B)Pr[v^^V^(B)] 

v i j BeTi 

= EEE M *>>/^) 

v i j 

The second line is derived by simply exanding tt'I-(A^B). The third line is derived by deter- 
mining the coefficient for each Mij(v) in the previous line. The final line is derived by replacing 
SseT w ij(vi, B) Pr [■?_,; <— V_i(B)] with fij(v). One should interpret fij(v) to be the weight of 
awarding item j to bidder i on profile v. Therefore, the allocation rule whose second-order reduced 
form maximizes tt" ■ w over all feasible second order reduced forms is simply the one that selects 
the max-weight allocation on every profile, where the weight of awarding bidder i item j on profile 
v is fij(v). This is exactly the allocation rule SOVCGj^(w). □ 

Proposition 3. Every corner in SO{J-,T>) can be implemented by a simple second-order VCG allo- 
cation rule, and the reduced form of any second-order VCG allocation rule is a corner in SO(J- ,T>). 

Proof. The proof is truly identical to that of Proposition [2] after replacing Lemma[T]with LemmaEJ 

□ 

Proof of Theorem^ Again, the proof is identical to that of Theorem [TJ after replacing Proposition [2] 
with Proposition (3[ □ 

We conclude this section with a discussion on the content of Theorems [TJ and [9[ Again, we 
are not claiming that every allocation rule can be implemented as a distribution over second-order 
VCG allocation rules. This is again not true, and the same example from Section [3] bears witness. 
Let's take a step back and view virtual and second-order VCG allocation rules as special cases of 
a more generic type of allocation rule: 

Definition 5. A weight-scaling allocation rule is defined by a dimension, k, and a collection 
of functions {Vij, W / ij}(ij)g[m]xy • Vij rnaps a type of bidder i to a k-dimensional weight vector 
(Ti -> R k ) and maps the remaining types to a k-dimensional scaling vector (xi'^Ti' — > ~R k ). 



17 



On any profile v, the weight-scaling allocation rule with functions {Vij, Wij}ij selects the max-weight 
allocation with weights: 

fijiv) = Vij(vi) ■ Wijiv-i) 

In other words, the weight of awarding bidder i item j is the dot product of two vectors, one 
contributed by bidder i's type, and the other contributed by the rest of the profile. It is not hard to 
see that every deterministic allocation rule can be implemented by a weight-scaling allocation rule of 
dimension Y\i \Tj\. n It is also not hard to imagine that in order to specify arbitrary deterministic 
mechanisms as a weight-scaling allocation rule, dimension JX \T^\ might be necessary. However, 
virtual VCG allocation rules are weight-scaling allocation rules of dimension 1 (and furthermore, 
Wij{v-i) = 1 for all i,V-i), and second-order VCG allocation rules are weight-scaling allocation 
rules of dimension maxj |Tj|. By restricting ourselves to only care about the reduced form or second- 
order reduced form, we have drastically simplified the space of allocation rules. Our characterization 
theorems show that every allocation rule has the same reduced form as a distribution over weight- 
scaling allocation rule of dimension 1, and the same second-order reduced form as a distribution 
over weight-scaling allocation rule of dimension maxj |Tj|. 

A Details from Preliminaries 

We provide a formal definition of Bayesian Incentive Compatibility and Individual Rationality of 
a mechanism for independent bidders and state a well-known property of the Ellipsoid Algorithm. 
For completeness, we provide an additional proposition showing a standard trick that can force the 
Ellipsoid algorithm to always output a corner. 

Definition 6. [15] (BIC /e-BIC Mechanism) A mechanism M is called e-BIC iff the following 
inequality holds for all bidders i and types t^t[ G T{-. 

Et-^-, [U^TuMiin ; t-i))] > Et_.„2j_. [UiijuMiiji ; t-i))] - ev max ■ max 
where: 

• Ui(A, Mi{B ; t-i)) denotes the utility of bidder i for the outcome of mechanism M if his true 
type is A, he reports B to the mechanism, and the other bidders report t—i; 

• ^max is the maximum possible value of any bidder for any item in the support of the value 
distribution; and 

• tt^(A) is the probability that item j is allocated to bidder i by mechanism M if bidder i reports 
type A to the mechanism, in expectation over the types of the other bidders, assuming they 
report truthfully, and the mechanism's internal randomness. 

In other words, M is e-BIC iff when a bidder i lies by reporting r[ instead of his true type T{, she 
does not expect to gain more than et> max times the maximum of 1 and the expected number of items 

13 Specifically, to implement any deterministic allocation rule M, index the possible profiles as Pi, ... , Pk- If on 
profile P a , U =fc Vi, set (Vij) a (vi) — for all j. Similarly, if t-i v-i, set (Wij)a(v-i) = for all j. If t-i = v~i on 
profile P a , set (Wij) a {v-i) = 1. If U ~ Vi, and M awards bidder i item j on profile P a , set (Vij) a (vi) — 1. If tt = 
and M doesn't award bidder i item j on profile P a , set (Vij) a (vi) = —1. Then on any profile, we will have fij(v) — 1 
iff M awards bidder i item j on v, and —1 otherwise. 
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that t[ receives. A mechanism is called BIC iff it is 

We also define individual rationality of BIC/e-BIC mechanisms: 

Definition 7. A BIC/e-BIC mechanism M is called interim individually rational (interim IR) iff 
for all bidders i and types Tj G T,: 

Et^v-AUiin^Uin ■ t-i))]>0, 

where Ui(A, Mi(B ; t-i)) denotes the utility of bidder i for the outcome of mechanism M if his 
true type is A, he reports B to the mechanism, and the other bidders report t-i. The mechanism 
is called ex-post individually rational (ex-post IR) iff for all i, rj and t-i, \Ji(ji^Mi(ji ; t-i) > 
with probability 1 (over the randomness in the mechanism). 

Theorem 10. [Ellipsoid Algorithm for Linear Programming] Let P be a convex polytope in M. d 
specified via a separation oracle SO, and c-x be a linear function. Assume that all coordinates of 
a and b, for all separation hyperplanes a - x < b possibly output by SO, and all coordinates of c are 
rational numbers of bit complexity £. Then we can run the ellipsoid algorithm to optimize c ■ x over 
P, maintaining the following properties: 

1. The algorithm will only query SO on rational points with bit complexity poly(d,£). 

2. The ellipsoid algorithm will solve the Linear Program in time polynomial in d, £ and the 
runtime of SO when the input query is a rational point of bit complexity poly(d,£). 

3. The output optimal solution is a corner of 

Proposition 4. Let a be a d-dimensional vector, whose coordinates are rational numbers of bit 
complexity £\, P be a d-dimensional convex polytope, in which all coordinates of all corners are 
rational numbers of bit complexity £2. Then we can transform a into a new d-dimensional vector b, 
whose coordinates are all rational numbers of bit complexity d{£\ + 1) + {2d 2 + 1)^2 + 1, such that 
x* = argmax^ g p b ■ x is unique. Furthermore, x* is also an element of argmax^ g p a ■ x. 

Proof. Let a, = Pi/qi, where both pi and qi are integers with at most t\ bits. Now change the 
dj's to have the same denominator Q = So etj = p^/Q, where p\ = piUj^qj. Both Q and 

p\ have at most di\ bits. Let now bi = (p^ + 2~( 1+£2 +( 2 ^ 2+1 )'*) / Q . So 6j can be described with 
d(£i + 1) + (2d 2 + 1)£ 2 + 1 bits. 

Now we will argue that, for any rational vector z^O, whose coordinates can be described with 
at most 2^2 bits, b • z ^ 0. Let Zi = ri/si, where both and Sj are integers with at most 2£ 2 bits. 
Now modify Zi to be r^/S, where S = HiSi and r[ = r^j^Si. Both S and r[ have at most 2d£ 2 
bits. Now consider the fractional parts of Q ■ S ■ (b ■ z), which is 

d 
i=l 

But this equals to only when r.; = for all i. Thus, if z / 0, b ■ z 7^ 0. 

Next, we argue that, if x and y are two different vectors, whose coordinates can be described 
with £ 2 bits, b ■ x 7^ b ■ y. This is implied by the above argument, since all coordinates of x — y can 



14 Strictly speaking, the definition of BIC in [15] is the same but without taking a max with 1. We are still correct 
in applying their results with this definition because any mechanism that is considered e-BIC by [15] is certainly 



considered e-BIC by this definition. We basically call a mechanism e-BIC if either the definition in [31 1171 118] (ei> ma x) 
or [15] (eumax J2j ^ijiwi)) holds. 

15 This is well-known, but also proved in Proposition [4] for completeness. 
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be described with at most 2^2 bits. So there is a unique optimal solution to max^ e p b ■ x. Call that 
solution x*. 

Now we show that x* is also an optimal solution for max^ g pa- x. We only need to argue that 
if corner x is not optimal for a, it will not be optimal for b. First, it is not hard to see that for 
corners x and y, if a ■ (x — y) ^ 0, a ■ (x — y) > 2 2<u 2 q • Second, for any corner x, 

d 

\{b-a)-x\<Y, 

i=l 

So if a ■ x > a ■ y, b ■ x is still strictly greater than b ■ y. Thus, x* must be an optimal solution for 
max^ e p a ■ x. □ 

B Input Model 

We discuss two models for accessing a value distribution T>, as well as what modifications are 
necessary, if any, to our algorithms to work with each model: 

• Exact Access: We are given access to a sampling oracle as well as an oracle that exactly 
integrates the pdf of the distribution over a specified region. 

• Sample-Only Access: We are given access to a sampling oracle and nothing else. 

The presentation of the paper focuses on the first model. In this case, we can exactly evaluate the 
probabilities of events without any special care. If we have sample-only access to the distribution, 
some care is required. Contained in Appendix A of [15] is a sketch of the modifications necessary 
for all our results to apply with sample-only access. The sketch is given for the item-symmetric 
case, but the same approach will work in the asymmetric case. Simply put, repeated sampling will 
yield some distribution T>' that is very close to T> with high probability. If the distributions are 
close enough, then a solution to the MDMDP for T>' is an approximate solution for T>. The error 
in approximating T> is absorbed into the additive error in both revenue and truthfulness. 

C Omitted Proofs from Section [3] 

This appendix contains the missing proofs from Section [3l and our tie-breaking lemma (Lemma H]). 
Proof of Observation d - For any feasible allocation rule M, and any type profile v, the (possibly 
randomized) allocation M(v) is a distribution over feasible deterministic allocations. So let M(v) 
sample the deterministic allocation Ai(v) with probability Pi(v). Then M(v) can be implemented 
by uniformly sampling x from [0,1] and selecting Ai(v) iff Ylj<iPj('v) < x — Ylj<iPj(v)- So 
for y £ [0, 1] let M^ v ' denote the deterministic allocation rule that on profile v implements the 
deterministic allocation selected by M(v) when x = y, then M is exactly the allocation rule that 
samples x uniformly at random from [0, 1] and implements the deterministic allocation rule M^ x \ 
So every feasible allocation rule is a distribution over deterministic allocation rules. The other 
direction is straight-forward: any distribution over feasible deterministic allocation rules is still 
feasible. □ 

Proof of Proposition d - It is clear that there are only finitely many deterministic allocation rules: 
there are finitely many choices per profile, and finitely many profiles. So consider the set S that 
contains the reduced form of every deterministic allocation rule that is feasible with respect to J 7 . 
We claim that F(J-, T>) is exactly the convex hull of S. Consider any feasible reduced form tt. Then 



2~(l+4+(2cK 2 +l)-i) 

Q 
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there is some feasible allocation rule M that implements tt. By Observation [Tj M is a distribution 
over deterministic allocation rules, sampling Mj with probability pj. Therefore, if 7Tj denotes the 
reduced form of Mi, we must have tt = Y2iPi^ii so ^ * s i n the convex hull of S. Similarly, if a 
reduced form tt satisfies tt = ^^PiTTi, where 7fj is the reduced form of a deterministic allocation 
rule Mi for all i, the allocation rule that selects Mi with probability pi implements tt. So the space 
of feasible reduced forms is exactly the convex hull of S, which is finite, and hence its convex hull 
is a polytope. □ 

Proof of Lemma \Q The proof is straight-forward once we correctly interpret tt ■ w. Expanding the 
dot product and using that fij(A) = Wij(A)/ Pr[tj = A], we see that: 

TT-W = ^2^2Y^ ^ij(A)Wij(A) 

i j AeTi 
i j AeTi 

If the "weight" of awarding item j to bidder i when her reported type is A is fij(A), then the 
last line is exactly the expected weight of items awarded by an allocation rule whose reduced form 
is tt. The feasible allocation rule that maximizes the expected weight of items awarded simply 
chooses a max-weight feasible allocation on every profile. This is exactly what VVCGj^({fi}i^ m j) 
does, i.e. exactly what VVCGj^(w) does. So the reduced form of VVCGjr(w) exactly maximizes 
x ■ w over all x G F(F, T>). □ 

Proof of Proposition^ We first prove that every corner of F(J-, T>) can be implemented by a simple 
virtual VCG allocation rule. From Proposition HJ we know F(J-,T>) is a convex polytope. So for 
every corner tt G F(F, T>), there is a weight vector w, such that V n ' G F{J-,T>) and tt' ^ 7?, 

w ■ tt > w • tt' . 

So by Lemmad] we know that tt must be the reduced form of VVCGj^(w), as tt maximizes x ■ w 
over all x G F(J-,T>). To see that VVCGj^{w) is simple, assume for contradiction that there is some 
profile with multiple max-weight feasible allocations. Let B denote the allocation rule that chooses 
the exact same allocation as VVCGj^(w) on every other profile, but chooses a different max-weight 
feasible allocation on this profile. Let t?b denote the reduced form of B. By the definition of B, we 
still have ttb ■ w = tt ■ w. Yet, we also clearly have ttb 7^ tt, as they are reduced forms for allocation 
rules that are identical on all but one profile, where they differ. This contradicts the choice of w, 
so VVCGjr{w) must be simple. 

Now we show that the reduced form of any simple virtual VCG allocation rule is a corner of 
F(J-,T>). Let tt be the reduced form of VVCGj^(w). Then for any other tt' G F(J-,T>), we must 
have tt ■ w > tt' ■ w. Otherwise, let tt' denote a feasible reduced form with tt' ■ w > tt ■ w, tt' 7^ tt and 
let M' implement tt'. Then clearly, there is some profile where the allocation chosen by M' differs 
from that chosen by VVCGf(w) and its weight with respect to {/i}jg[ m ] is at least as large as the 
weight of the allocation chosen by VVCGj^(w). As VVCGj^(w) is simple, this is a contradiction. 
Therefore, tt ■ w > tt ' ■ w for all tt' 7^ tt G F(T, T>) and tt is a corner. □ 

Proof of TheoremUk Since F(J-,T>) is a convex polytope (Proposition [T]), by Caratheodory's The- 
orem, we know that every point in F(F,T>) can be written as a convex combination of at most 
n'Y^iLi \Ti\ + 1 corners of F(F,T>). By Proposition [2j we know that every corner of F(F,D) can 
be implemented by a simple virtual VCG allocation rule. Finally, we observe that if the allocation 
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rules Mi implement 7?j, then the allocation rule that samples Mi with probability pi implements 



Proof of Corollary d' As F{T, T>) is a convex polytope, we know that tt € F(J-,D) if and only if 
for all w G [-1, l T *l : 

tv ■ w < max 7? • io. 

7T>eF(T,T>) 

By the definition of VVCGj^(w), the right hand side is exactly Wj^(w). □ 

Lemma 4. Lei w be a weight vector whose coordinates are rational numbers of bit complexity £\, 
and let £2 be such that for all i, A E Ti, Pr[i, = A] is a rational number of bit complexity £2. 
Then the lexicographic tie-breaking rule can be implemented by a simple transformation that turns 
w into w' such that VVCGj^(w') is simple, VVCGj^(w') selects a maximum weight allocation with 
respect to the scaled weights w on every profile, and each coordinate of w' is a rational number of 
bit complexity nl\ YliLi l^il + ( n l^il + 1)^2 + mn + n + 1. 

Proof of Lemma [7}' Let {fi}i denote the weight functions used by VVCGj^(w) (i.e. fij(A) = 
Wij(A)/ Pr[i,; = A]), and rewrite each value fij(A) with a common denominator. Before rewriting, 
each fij(A) was a rational number of bit complexity £\ + £2- After rewriting, the numerator 
and denominator of each fij(A) has at most b = n(YliLi\Ti\)(£i +£2) bits. Now define new 
weight functions {f[}i such that flj(A) is equal to fij(A) except that 2~ b ~ m ~^~~ l is added to its 
numerator. In going from {fi}i to the numerator of the weight of any allocation goes up 

by at most 2~ b ~ n ~ l , not enough to make an allocation optimal if it was suboptimal. Moreover, 
notice that the last mn bits of the numerator are in one-to-one correspondence with possible 
allocations. So even if there were any ties before, there can't be any ties after the transformation, 
and ties will be broken lexicographically. It is also obvious that we have only added nm + n + 1 
bits to each numerator of fij(A), and therefore w' i j(A) = f-j(A)Pr[ti = A] has bit complexity 
b + nm + n + 1 + £ 2 = n£ x j^Zi l r ;l + ( n IXi \ T i\ + l)4a +mn + n + l. □ 

D Omitted Details from Section [4] 
D.l Separation Oracle 




Variables: 

• t, denoting the value of Wj^(w). 

• Wij(A) for all bidders i, items j, and types A G T,,. 
Constraints: 

• — 1 < Wij(A) < 1 for all bidders i, items j, and types A £ Ti, guaranteeing that the weights 
lie in [-l,l]»E£iPll. 

• SO(w,t) = "yes," guaranteeing that t > Wj^(w) 
Minimizing: 

• t — 7? • w (this is gjf(w) provided t = Wjr{w)). 



Figure 1: A Linear Program to minimize g^{w). 
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Proof of Lemma® The dimension of the LP (shown in Figure [T]) used to run SO is n'Y^iLi l^il- 
Every constraint of the linear program that is not part of SO has bit complexity 0(1), and the 
coefficients of every hyperplane output by SO have bit complexity I by our hypothesis. (Recall our 
discussion in Section [3, 1[ ) So by the theory of Gaussian elimination, the coordinates of all corners 
of the LP of Figure Q] are rational numbers of bit complexity poly(n^™ 1 |Tj|,^). Now recall the 
third property of the Ellipsoid algorithm from Theorem [TUJ □ 

D.2 Decomposition Algorithm 

Before giving the proof of Theorem [3] we present a useful lemma. 

Lemma 5. Let P be a polytope and Hi, . . . , Hi be i hyperplanes of the form wj ■ v = hj such that 
every point x £ P satisfies x ■ Wj < hj. Then for any c\, . . . ,a > 0, any a £ P satisfying: 



a • I > CjWj 



is in n l j =l Hj. 



Proof. Because all a € P satisfy a ■ Wj < hj for all j and Cj > Vj, the only way to have 
a • (j2j c j^j) = Ylj c jhj is to have a • CjWj = Cjhj for all j. Thus, we must have a - Wj = hj for all 
j, meaning that a € djHj. □ 

Proof of Theorem® Let jf, w' denote the output of the corner oracle. First, we observe that if 
H\, . . . ,H a intersect inside F(J-,T>), there is some reduced form j? satisfying jf' ■ Wj = hj for all 
j. Therefore, such a reduced form must also satisfy n' ■ w = \ YTj=i^j- Second, as no feasible 
reduced form can have v-ulj > hj, we also get that no feasible reduced form has v-w>^ YTj=i hj- 
Putting these two observations together, we see that there exists a jf with jf ■ w = ^X^=i^i; 
and this is the maximum over all feasible reduced forms. Therefore, the reduced form Rjr{w) 
of VVCGjp(w) necessarily has Rj?(w) ■ w = \ YTj=ihj- Lemma H] tells us that tt is the reduced 
form of a simple virtual VCG allocation rule VVCGj^(w'), which also maximizes x ■ w over all 
feasible reduced forms x. Therefore, jf ■ w = \ X^/=i hj an d by Lemma El jf is in r\j =1 Hj. From 
Proposition [21 we know jf is a corner. As each coordinate of Wj is a rational number of bit 
complexity b, and a < n s ?2 l ™ =l |Tj|, we see that each coefficient of w is a rational number of bit 
complexity poly(log(n YliLi l^il)> Lemma H] then guarantees that each coefficient of w' is a 
rational number of bit complexity poly(n^™ 1 \Ti\,b,£). □ 



E Proofs Omitted From Section I5.lt Exact Implementation 

We bound the running time of the algorithms of Section [J] when T> is a, possibly correlated, uniform 
distribution. Before doing this, we establish a useful lemma. 

Lemma 6. For all J- and T>, if every corner of F{T, T>) is a vector of rational numbers of bit 
complexity b, the probabilities used by T> have bit complexity i, and SO 's input jf is a vector of 
rational numbers of bit complexity c, then the following are true. 

1. The separation oracle SO of Section \4-l\ can be implemented to run in time polynomial in 
nY^iL\\Ti\, b, c, t, \D\, and rtjr(poly(nY^iLi\Ti\,b,c,£)). Furthermore, the coefficients of 
any hyperplane that can be possibly output by SO have bit complexity poly(n^2iLx \Ti\,b). 
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2. If the corner oracle CO of Section \4-2\ only takes as input hyperplanes output by SO, it can be 
implemented to run in time polynomial inn^iLi\Ti\, b, I, \D\, andrt^ {poly(nY^iLi\Ti\,b,£)) . 

Proof. We first bound the runtime of SO, using Theorem [TUJ The separation oracle is a lin- 
ear program with 1 + n Y2iLi 1^1 variables, 2n^^!_ 1 |Tj| constraints, and an internal separation 
oracle SO. SO on input (w, t) simply checks if Rjr(w') ■ w, where w' is the perturbation of w 
according to Lemma HJ is smaller than or equal to t. If not, it outputs the separation hyper- 
plane (Rj^(w'),—l)(w,y) < 0. Given that Rj^(w') is a corner of the polytope and corners have 
bit complexity b, Theorem [10] tells us that SO will only be called on w, t whose coordinates are 
rational numbers of bit complexity at most poly(n^™ 1 |Tj|, max{6, c}). To compute Rjr(w') ex- 
actly we can enumerate every profile in the support of T>, run VVCGj^{w'), and see if bidder i 
was awarded item j, for all i,j. As the coordinates of w' are rational numbers of bit complexity 
poly(n YliLi 1^1' m a x {&) c, £}) (after Lemma H was applied to w)), this method exactly computes 
Rf{w') in time polynomial in raXX=i \Ti\, b,c, £ and rt^(poly(n i \Ti\,max{b,c,£})). After 
computing Rj?(w'), SO simply takes a dot product and makes a comparison, so the total runtime 
of SO is polynomial in n^™L l \Ti\, b, c,£, \T>\ and rtjr(poly(n^2™ =1 |Tj|,max{6, c, £}))■ Also, by 
Lemma [21 we know that all hyperplanes output by SO have coefficients that are rational numbers 
of bit complexity poly(n^™ =1 |Tj|, b), which is independent of c. 

The corner oracle of Section l4~2l has three steps. The first step is simply computing the average of 
at most nY^iLi \Ti\ vectors in R n 52T=i Pi | 

whose coordinates are rational numbers of bit complexity 
poly(n YliLi l^il> b) (by the previous paragraph). The second step is applying Lemma 0] to the 
averaged weight vector to get w' . So each weight of w' is a rational number of bit complexity 
poly(n^™ 1 \Ti\,£,b). The last step is computing Rjr(w'). It is clear that the first two steps can be 
implemented in the desired runtime. As the coordinates of w' are rational numbers of bit complexity 
poly(n YliLi \Ti I ) b,£) , we can use the same method as in the previous paragraph to compute Rj^(w') 
in time polynomial in n^™ x |Tj|, \V\, b,£ and rfjr(poly(n YliLi \Ti\,b,£)), to implement CO in the 
desired runtime. □ 

Corollary 2. For all J 7 , ifD is a (possibly correlated) uniform distribution over k profiles (possibly 
with repetitions), and SO's input n is a vector of rational numbers of bit complexity c, then the 
following are true. 

1. The separation oracle SO of Section \4-l\ can be implemented to run in time polynomial in 
nY^lLi\Ti\, k, c and rtjr [poly [n'Y^X^Ti], log k,c)). Furthermore, the coefficients of any hy- 
perplane that can be possibly output by SO have bit complexity poly(nY^iL\ \T%\Aogk). 

2. If the corner oracle CO of Section \4-2\ only takes as inputs hyperplanes output by SO as input, 
it can be implemented in time polynomial in n^™ =1 |Tj|, k, and rtj?(poly(n^™L 1 \Ti\, log k)). 

Proof of Corollary^ Every corner of F(J-,T>) is the reduced form of a deterministic mechanism. 
So let us bound the bit complexity of the reduced form ir of a deterministic mechanism M. We 
may let nij(A) denote the number of profiles (with repetition) in the support of T> where bidder z's 
type is A, and M awards item j to i, and let dij(A) denote the number of profiles where bidder i's 
type is A. Then for all i,j,A £ Tj, -Kij(A) = ^' J ^j . As nij(A) and dij(A) are integral and at most 
k, TTij(A) has bit complexity O(logfc). So we may take b = 0(log/c), £ = 0(logk), \V\ = k, and 
apply Lemma O □ 

Next we bound the running time of the decomposition algorithm. 
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Corollary 3. For all T , if T> is a (possibly correlated) uniform distribution over k profiles (pos- 
sibly with repetitions), then given a reduced form tt € F^J 7 ,!)), which is a vector of rational num- 
bers with bit complexity c, we can rewrite it as a convex combination of corners of F{F,V) us- 
ing the geometric algorithm of Theorem^ with running time polynomial in nY^ZLi 1^1 > ^> c an< ^ 
rtjr(poly{nYT=i \ T i\^gk,c)). 

Proof of Corollary 0- From Corollary [2] it follows that the coefficients of any hyperplane that can 
be possibly output by SO have bit complexity poly(n^^i 1 |Ti|, log So to apply Theorem [2] it 
suffices to bound the running time of SO and CO on vectors of rational numbers of bit complex- 
ity c' = poly(n^^. 1 1 Tj |, log k, c). Using Corollary [2] this is polynomial in ^X^il^il; ^, c and 
rtj-(poly(n YliL 1 1 | , log A;, c) ) . Combining this bound with Theorem [2] finishes the proof. □ 



F Proofs Omitted From Section 15.21 : Approximate Implementa- 
tion 

Notation We will use the following notation throughout this section: k is the number of samples 
taken directly from D, k! is the number of samples taken from T>-i after fixing t » = A for all i, A £ Tj, 



k" is the total number of samples taken (i.e. k" = k + k''^2 i |Tj|) and T>' is the distribution that 
samples one of the k" sampled profiles uniformly at random. We also make use of the following, 
standard Chernoff bound: 

Theorem 11. (Hoeffding ]19j) Let X%, . . . ,X n be independent random variables in [0, 1], and let 
X = Y,i Xi/n. Then Pr[|X - E[X]| > t) < 2e~ 2t2n . 

F.l Every point in F(iF,V) is close to some point in F(LF,V) 

The desired claim is stated at the end of the section as Corollary [6j and the proof is obtained by a 
series of small technical lemmas. Throughout this section, we will be interested in whether a profile 
with ti = A in the support of T>' is genuine from the perspective of bidder i (i.e. it was sampled 
from T> without conditioning on anything except perhaps ti = A) or biased (i.e. it was sampled by 
conditioning on ty = A' for some i! ^ i). Now let us fix an allocation rule M with reduced form tt if 
bidders are sampled from T>. What does the reduced form of M look like for bidders sampled from 
T>'1 The expected probability (over the randomness in the types of the other bidders) that bidder 
i receives item j conditioning on ti = A on a genuine from bidder i's perspective profile is exactly 
TTij(A). However, if the profile is biased, the probability that bidder i receives item j might have 
nothing to do with irij(A). So for a fixed D', we'll let Gi(A) denote the (random) set of profiles in 
the support of V with ti = A that were obtained genuinely from the perspective of bidder i, and 
Bi(A) denote the set of profiles with ti = A that are biased from the perspective of bidder i. 

Lemma 7. Fix i and A £ Ti, let M be any allocation rule, and let T>' be such that \Gi{A)\ = x 
and \F>i(A)\ = z (i.e. condition on |Gj(^4)| = x, \F>i(A)\ = z and then sample T>'). Then over the 
randomness in generating T>' , for all items j and all t < 1, if tt' denotes the reduced form of M 
when bidders are sampled from T>' and tt denotes the reduced form of M when bidders are sampled 
from T>, we have: 



Pr 



\TT ij (A)-TT , lj {A)\>t+^ 



< 2e~ 2t2x . 



Proof. Label the profiles in Gi(A) as P±,...,P X and F>i{A) as P x +i, ■ ■ ■ ,P X + Z , and let X a be the 
random variable denoting the probability that M awards item j to bidder i on profile P a . Then 
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for all 1 < a < x we have E[X a ] = TVij(A). For all a > x, we have < E[X a ] < 1. As nUA) 
lhil2a X cL, we see that: 



TTijiA) 



— < — - — ^ij(A) < 



X + Z X + z 
X 



X + z 



i=l 



——Try (A) + — ^— < 7tij (A) + — ^— . 



So |E[7r^(^4)] — 7Tjj(A)| < Therefore, the triangle inequality tells us that in order to have 

\wij(A) - vr^.(A)| > t + § > t + we must have ^.(A) - E[vr^(A)]| > t. As tt^(A) is the 
average oi x + z independent trials, by the Hoeffding inequality, this happens with probability at 
most 2e~ 2t x . 



Lemma 8. For any i,A S Ti, \Gi{A)\ > k' . Furthermore, if k > k'^2^^ |Tj/|, for all x < 1 



□ 



we 



Pr 



2x + j)Y1 \TA-\Gi{A)\ 



< 4 e -2^fc'«E,v.l r .'l 



Proof of Lemma\^ The first claim is obvious, as we fix t i = A in exactly k' profiles. For the second 
claim, there are ^'(X^'^i l^i'l) independent chances to get a profile in Bi(A). Each chance occurs 
with probability q = Pr[ij = A]. There are k independent chances to get additional profiles in 
Gi(A), and each occurs with probability q. Therefore, we get that E[|Sj(^4)|] = qk'J2i'=a l^il an d 
E[|Gj(A)|] = k' + qk. Applying the Hoeffding inequality, we get 



Pr 



{BiWl-qk'^lTi, 



> xk'^2\Tii 



< 2e - 2 fc'E lV J^I 5 



and 



Pr 



Gi(A)\ -(k' + qk) 



> xk 



< 2e 



-x 2 k 



Then since k > k' I^Vli by union bound, we get that for any x < 1, with probability at least 

1 — Ae~ 2x k '^'1 we have the following two inequalities: 

\Bi(A)\<qk'J2\ T i'\+ x k'J2\ T i'\ 

\Gi(A)\>k' + m&x{0,(q-x)k} (1) 

which imply the following two inequalities by ignoring one of the positive terms on the right-hand 
side of Equation ([!]): 



\Bi(A)\ <(q + x)^2 -\Gi{A)\ 

|Sj(^4)| < - — ^ • |Gj(^4)| (we only use this when q > x) 



q — x 



k 



(2) 
(3) 
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When q < x + tet, Equation ([2]) gives a better bound. Otherwise, Equation (J3j) gives a better bound. 
As q decreases, the bound from Equation (J2J) only gets better. Likewise, as q increases, the bound 
from Equation (J3]) only gets better. So for any q, one of the bounds will yield: 



\Bi{A)\ < Ux+YjY^\ T v\ ■ \ G 



as desired. □ 



Corollary 4. Let M be any allocation rule and assume k > k'Y^i'^i I^Vl- Then for all items j, 
bidders i, types A G Tj, and all t < 1, if it 1 denotes the reduced form of M when bidders are sampled 
from T>' and tt denotes the reduced form of M when bidders are sampled from D, we have: 



Pr 



< 6e 



-2t*fc' 



-2x 2 k' 



Proof of Corollary^- Lemma[8]says that with probability at least 1— 4e 2x k ^Vil-^'l > 1— 4e 

V is such that |J3i(A)| < (2x + f)EiVi l T *'l ' \ G *( A )\ and ^ k '• For such the bound 

given by Lemma [7] is: 



Pr 



< 2e 



-2t 2 k' 



So after taking a union bound and setting x = t we get the desired claim. □ 

Corollary 5. Let M be any allocation rule and assume k > k'^2^^ |7V|. T/ien zf ff ' denotes the 
reduced form of M when bidders are sampled from V and tt denotes the reduced form when bidders 
are sampled from D, we have: 



Pr 



Ivf-vf'loo >t + (^t + j^j^2\Ti\ 



< 



6n \Ti\e 



-2t 2 k' 



i=l 



Proof of Corollary Use Corollary [H observe that |Tj/| > Yli'^i \Ti'\ f° r au h an d take a union 
bound over all j, i,A G Tj. □ 

Corollary 6. Assume k > k' |Tj|. TTien /or aZZ i < 1, mi/i probability at least 

m 

l-6nJ2\Ti\e- 2t2k '- n ^W lnt , 
i=l 

for every tt G F(F,V), there is a tt' G F(F,V) with \tt - tt'^ <2t + (2t + ^) 

Proof of CorollarylQ Consider an t-i^, cover of F(F, D) such that every point in F(F,T>) is within 

£oo distance t of a point in the cover. There is certainly a cover that uses at most (j) 1=1 
points, as there is a cover of the entire hypercube using this many points. If for every point x in 
the cover, there is a point z G F(F, V), such that \x — z|oo < t + (2t + 4-) £\ |Tj|, then clearly for 
every point tt in F(J r , X>), there is a point vf' G F(F, V) such that |tT - tt'Ioo < 2i + (2t + ^) £\ 
by the triangle inequality. So we simply take a union bound over all e~ n '^ i = 1 \ Ti \ lnt points in the 
cover and apply Corollary [5] to conclude the proof. □ 
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F.2 Every point in F(F,V) is close to some point in F(F,V) 

In the previous section, we showed that for all tt £ F(F, T>) there is a nearby jf' G ^(J 7 , P') using 
the probabilistic method over the choice of V . In this section, we want to show the other direction, 
namely that for any reduced form t? € F(J-,T>') there is a nearby reduced form tt € F(F, T>). 
However, it is not clear how to use the probabilistic method over the choice of V to prove this, as 
for tt' E F(F, T>') the allocation rule that implements tt' is heavily dependent on D', which is the 
object with respect to which we plan to apply the probabilistic method. To go around this circularity 
we show that after fixing k and k', but before selecting T>', there are not too many allocation rules 
that could possibly implement a reduced form that is a corner of F(F,T>'). Specifically, we show 
that the reduced form with respect to T>' of any simple virtual VCG allocation rule is equivalent to 
one whose functions only output rational numbers with bit complexity that only depends on k, k' 
and the dimension. In particular, regardless of T>', there is an a-priori fixed set of allocation rules 
that implement the corners of F(F,T>') whose cardinality depends only on k, k' and the dimension. 
So we can still use concentration of measure to argue that the reduced form tt' of every corner of 
F(F,T>') has a nearby reduced form tt £ F(F,T>). And, as every point in F(F,T>') is a convex 
combination of the corners of F(F,T>'), this suffices to prove the desired claim. Our starting point 
is the following observation. 

Lemma 9. Let tt be the reduced form of a simple virtual VCG allocation with respect to D' . Then 
each TTij(A) is a rational number of bit complexity 0(logk"). 

Proof of Lemma\^ In every simple virtual VCG allocation, the probability that bidder i gets item 
j on profile P is always 1 or 0. Therefore, if ti = A in exactly x profiles, and bidder i receives item j 
in exactly x' of those profiles, TTij(A) = x'/x. As x' < x < k", this value clearly has bit complexity 
0{\ogk"). a 

Given Lemma[9]the corners of F(F,T>') belong to a set of at most (k")°^ n ^i l Ti D reduced forms 
that is independent of T>' . Still the allocation rules that implement those reduced forms may vary 
depending on V . We show that this can be mitigated by appealing to the correctness of the 
decomposition algorithm of Section 14.21 

Lemma 10. Suppose that the allocation rule M implements a corner tt of F(F,T>'). Then there is 
a virtual VCG allocation rule 7yCG({/j} ig [ m ]) whose reduced form with respect to V is exactly tt 
and such that each fi only outputs rational numbers of bit complexity fcfoYliLi 1^1 ^°Sk")> where 
/ c (0 is a polynomial function. 

Moreover, for any input tt S F(F,T>') to the decomposition algorithm of Section \4-S\ for T>' , the 
output decomposition uses simple virtual VCG allocation rules whose weight functions only output 
rational numbers of complexity fc( n ^2iLi \Ti\ logk"). 

Proof of Lemma 1 1 (k For the first part of the theorem, suppose that a corner tt of F(F,T>') is fed 
as input to the decomposition algorithm of Section 14.21 From the correctness of this algorithm it 
follows that the output decomposition consists of a single reduced form, namely tt itselft, as tt is a 
corner of F(F,T>'). The decomposition algorithm of [8] (which is at the heart of the decomposition 
algorithm of Section 14. 2p has the property that every corner used in the output decomposition will 
always be an output of the corner oracle. So let us try to argue that all reduced forms that are 
possibly output by the corner oracle can be implemented by a small set of allocation rules that 
does not depend on T>'. We use the following lemma: 

Lemma 11. On any input tt, the coefficients of the hyperplane output by the separation oracle 
SO of Section \4-l\ using V as the bidder-type distribution are rational numbers of bit complexity 
f s (nY^iLi\Fi\logk"), where f s (-) is a polynomial function. 
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Proof. By Lemma [9J we know that all coordinates of all corners of F(F, T>') can be described using 
at most 0(logk") bits. Lemma [2] now tells us that SO will only output rational numbers of bit 
complexity poly(n X^^Li l^iM°§ '-' 

Now let's go back to the corner oracle. By Lemma [11] the weights input to the corner oracle 
will always be rational numbers of bit complexity f s (n YliLi 1^1 l°g = P°ly( n X^i=i I ^ I , log k"). 
As the number of hyperplanes input to the corner oracle will never be more than nY^Li \Ti\, 
the weights obtained by averaging in step 2 of the corner oracle require at most an additional 
0(log(ra YaLi l^il)) bits- Given the above and that, for all i, A, Pr[tj = A] is a multiple of 1/k" and 
hence has bit complexity 0(log k"), the tie-breaking rule in step 3 of the corner oracle results in a 
weight vector whose coordinates have bit complexity poly(n^™ 1 \Ti\, log k"). As Pr[tj = A] is a 
multiple of 1/k" for all i, A, transforming from the weight vector representation of the simple VCG 
mechanism computed by the corner oracle to the weight function representation adds at most an 
additional Oilogk") bits per weight. 

The second part of the lemma is already implied by the above discussion. As we noted above the 
decomposition algorithm of [8] (which is at the heart of the decomposition algorithm of Section P4.2[) 
has the property that every corner used in the output decomposition will always be an output of the 
corner oracle. And we argued that the corner oracle for T>' outputs simple virtual VCG allocation 
rules whose weight functions only output rationals of bit complexity as bounded above. □ 

Lemma flOl implies that, before we have sampled T>' but after we have chosen k and k' , there is 
a fixed set K. of at most 4 n ^=i l T *l/ c ( n £i=i l T * lo s fe ) simple virtual VCG allocation rules (namely 
those whose weight functions only output rational numbers of bit complexity /c(^Si=i 1^1 1°S 
such that, no matter what T>' is sampled, all corners of F{J-,T>') can be implemented by a simple 
virtual VCG allocation rule in /C. Moreover, the decomposition algorithm of Section [4.21 only uses 
simple virtual VCG mechanisms from /C in its support. This implies the following. 

Corollary 7. Assume k > k' £^ |Tj| and t < 1. Then, with probability at least 

m 

1 _ 6n ^| Ti | e -2t 2 fc'+nE™im|/ c (nE™im|logfc")ln4 ) 
i=l 

the following hold, where f c {-) is a polynomial function: 

1. every vf' G F(F,V) has some tt G F{F,V) with \jf - tt']^ <t + (2t + ^) £\ \T t \; 

2. if n is the reduced form with respect to T> of the distribution over simple virtual VCG allocation 
rules that is output on input tt 1 £ F(F,T>') by the decomposition algorithm of Section \4-2\ for 
V then iTf-Tf'loo <t + (2t + ^)J2i\Ti\. 

Proof of Corollary^ For a fixed simple virtual VCG allocation rule M & K, Corollary guarantees 
that the reduced form of M when consumers are sampled from D, tt(M), and when consumers are 
sampled from V, tt'(M), satisfy: \tt(M)—tt'(M)\ 00 < t + (2t+^) YT=i \ T i\ with probability at least 
1 - Qn J2T=i \Ti\e- 2t2k> . In addition, Lemma [TO] guarantees that \K\ < A^ n ^T=i 1^1 togfc'>E£i l T «l. 
Because this set is fixed a priori and independent of T>' , we may take a union bound over the 
elements of the set to get that the same claim holds for all simple virtual VCG allocation rules in 
K, with probability at least 

m 

1 _ 6nJ^ |Ti|e- 2i2fc, 4 nE -i \ T i\fdnj:T=i miiogfe"). 

i=l 
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We proceed to show (i) and (ii) conditioning on the above. For (i) we use the first part of 
Lemma flQl to get that for all n' € F(F,V), we can write tt' = YaPa??' (M a ), where for all a: 
M a € fC, p a > 0, and J2 a Pa = 1- ^ we consider the exact same distribution over simple virtual VCG 
allocation rules when consumers are sampled from T>, the reduced form will be tt = YaP^i^a)- 
Given that for all M a £ K we have \ir(M a ) -^'(Ma)]^ < t + (2t+^) YT=i \ T i\> we have l^-^'U < 

t + (2t + ^)YT=i\ T i\ as well. 

For (ii) the proof is virtually identical. By the second part of Lemma [TOl the simple virtual VCG 
allocation rules in the support of the decomposition belong to the set /C. We proceed as above. □ 

F.3 Putting Everything Together 
F.3.1 Setting k and k', and sampling V 

Algorithm [2] is a preprocessing algorithm used to set the parameters k and k' that were left free in 
Sections EH and E21 

Algorithm 2 Pre-processing to generate a proxy distribution D' for T>. The desired £oq accuracy 

is e. 

1: Input: V. Denote by T = Ya=i 1-^*1- 

9 . G P w _ £ U - n 2 T 2 f c (nT) , _ 4 fc'T 

3: Build V by sampling k profiles independently from D. For each i,A 6 Tj, fix ij = ^4 and 
sample fc' profiles independently from X>_j. V picks one of the k + k' YaLi l^il sampled profiles 
uniformly at random. 

4: Output t,k,k',V. 



F.3. 2 Separation Oracle for Approximating Polytope F(F, V) 
We provide the proof Theorem |H 

Proof of Theorem^ We use the shorthand T = YXLi l^il- After plugging in our choice of t, fc', /c, 
we see that 2t + (2t + y) X^E=i 1^1 — e - So we just have to verify that the probability bounds given 
by Corollaries [6] and [7] are as desired. 

Before plugging in the choice of t,k',k to Corollary [6l we get that the first claim is true with 
probability at least 1 - 6nT e - 2t2k '- nTlnt . As k! > nT/t 3 , this is at least 1 - 6nT e -( 2 /*- ln ( 1 /*)) nT '. 
As \jt asymptotically dominates ln(l/t) as t — > 0, this probability becomes 

1 _ e -n(nT/ e ) after 

plugging in our choice of t. 

Before plugging in the choice of t, k', k to Corollary [71 we get that the second claim is true with 
probability at least 1 — 6nTe~ 2t k + nT fc{nT\og{k+k ))in4_ pi U gg m g i n the choice of k! and k (and 
observing that log(/c + k') is 0(log (nT/t)) this becomes: 

1 - 6nTe" 2n2T2/c(nT)/ * + riTf c (nT.O(log(nT/t))) ln4_ 

Therefore, the ratio of the absolute value of the negative term in the exponent to the value of the 
positive term is poly ^^^m i so the negative term dominates asymptotically as t — >■ 0. Therefore, 
the entire probability is 1 — e -&( nT / e ) after plugging in our choice of t. 

The bound on the running time follows directly from Corollary [2] and our choice of parameters. 

□ 
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F.3.3 Decomposition Algorithm for Approximating Polytope F(T,V) 

Algorithm [3] describes our decomposition algorithm for F(J-, T>'). After stating it, we analyze it. 



Algorithm 3 Algorithm for decomposing a reduced form tt' G F^J 7 ,!)') into a distribution over 
simple virtual VCG allocations. 
1: Input: T, V, tt" G F(F,V). 

2: Run the geometric algorithm of [8] on F(~F, V) using the separation oracle of Section [4.11 and 
the corner oracle of Section H~2l The output will be a collection of at most n Y2i=i |3^j — f— X corners 
output by the corner oracle. These will be simple virtual VCG allocation rules, whose weight 

\ r \rp. I i J\ 

functions we denote by {/i}i g r m p ■ ■ ■ > {fi} i£ ^ =1 ■ We also denote by pj the probability 
placed on VVCG f{/i}^ m ] ) m the output decomposition. 
3: Output the allocation rule M that runs VVCG f{/i} 4 -g| m i ) with probability pj. 



Proof of Theorem [5|: It follows from the correctness of the decomposition algorithm of [8] that the 
output allocation rule M implements the input reduced form tt' when bidders are sampled from 
V' . Now it follows from Corollary [7] that with probability at least 1 — e~°( n ^=i l Ti l/ e ) (see the 
proof of Theorem [J] for why the probability guaranteed by Corollary [7J is at least this large given 
our choice of parameters) it holds that | tt — tt'Ioo — £ (again see the proof of Theorem [2] for why 
the guaranteed distance is at most e given our choice of parameters). The bound on the running 
time follows directly from Corollary [3] and our choice of parameters. □ 



F.4 Approximate Algorithms for F(IF, V) 

The results of this section are provided for completeness, but are not used elsewhere in the paper. 
Our goal is to use Theorems 0] and [5] to obtain approximate algorithms for the original polytope 
F(J-,T>). Namely, we want to obtain, with high probability, a separation oracle that is correct on 
all points that are e-away from the boundary of F(F, D) (in loo), an d a decomposition algorithm 
that returns a distribution over virtual VCG allocation rules whose reduced form is within e (in -^oo) 
of any given feasible reduced form that is e-away from the boundary. Such algorithms are provided 
by the following two corollaries. 

Corollary 8. Given our choice of k,k' in Algorithm^ a separation oracle for F(F,T>') is an 
approximate separation oracle for F(F,T>). Specifically, with probability 1 — e - ^™^ I^IA), we 
have: 

1. If the entire e-loc ball around x is outside F{T,V), then the separation oracle outputs "no." 

2. If the entire e-too ball around x is inside F(J-,T>), then the separation oracle outputs "yes." 

Proof. By Theorem [U with probability 1 — e~^ n ^^ Ti ^ e \ any point x G F(J-,T>') has some y G 
F(J-,T>) with \x — y\oo < e. Therefore, y is in F(J-,T>) as well as the e-£oo ball around x, and the 
entire e-ioo ball around x is not outside F(J-,T>). Taking the contrapositive proves part 1). 

For any point x ^ F(T, T>'), there is a separation hyperplane H (not going through x) separating 
x from F(J-, T>'). Grow an ball centered at x (i.e. a hypercube) until it hits H. Let y denote one 
point in the intersection of H with the ball. The ball has some positive radius, 5. So the distance 
from x to the closest point in H (y) is 5. Let now z = x + %{x — y). It is clear that the closest 
point (in £oo) in H to z is also y, and the distance is e + 5. Therefore, there is no point within e of 
z in F(T,V). By Theorem H with probability 1 - e~ n{n ^ \ T ^/ e ), this implies that z £ F(T,V). 
As \x — z\oo = e, taking the contrapositive proves part 2). □ 
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Corollary 9. With probability 1 — e — ^(^zJi \ T i\/ e ) i the Algorithmic has the property that for any 
reduced form tt such that the entire e-ioo ball around tt is inside F(F,T>), Algorithm^ outputs a 
distribution over virtual VCG allocation rules whose reduced form tt' satisfies \tt — 7r'|oo < e - 

Proof. Theorem [5] guarantees that with probability 1 — e~ ^ n ^i l T »IA) ; Algorithm [3] is such that 
whenever n is in F(F,T>'), a n' with | tt — tt'|oo — £ is output. Corollary [8l guarantees that with 
probability 1 — e~^ n ^>* \ Ti \/ e \ any tt whose entire e-£oo ball is contained in F(F,T>) is in F(F,T>'), 
which completes the proof. □ 

G Discussion and Proofs from Section [6] 

Approach. Theorems [6] through [8] are obtained similarly to the corresponding Theorems 8 
through 10 of [8]. In |15| . linear programs are provided that exactly solve MDMDP in cases with 
finite support (and less general feasibility constraints, namely each bidder has an upper bound on 
the number of items she wants to receive, and every item should be allocated to at most one bidder). 
However, the proposed LPs maintain variables for every type profile P, denoting the probability 
that bidder i receives item j on profile P, resulting in LP size proportional to \T>\. In [8] it is 
observed that these LPs can be made more efficient by making use of the reduced form, even if just 
a separation oracle is provided for the feasibility of the reduced form. Indeed, the reduced form 
of a mechanism contains sufficient information to verify truthfulness (given the additivity of the 
bidders), and a separation oracle for the feasibility of the reduced form is sufficient to optimize the 
expected revenue of the mechanism by solving an LP. Algorithm [4] and Figure [2] provide the details 
of how we apply this approach in our setting, culminating in a proof of Theorem [6J Simply put, we 
use the LP approach of [8] with the following twist: (a) we use a separation oracle for the proxy 
polytope of feasible reduced forms F(F,V), obtained in Section [5j rather than the real polytope 
F(F,T>); (b) still, we compute expected revenue for bidders sampled from the real distribution D. 

Algorithm 4 FPRAS for solving MDMDP when T> has finite support. 
1: Input: V, F, e. 

2: Set 5 = Run the pre-processing algorithm (Algorithm [2]) on input T>, with accuracy 5/2n. 

Call the output distribution V . 
3: Let SO(tt) be the separation oracle that on input tt executes the separation oracle of Section H~T1 

on input tt for distribution T>' and feasibility constraints F . 
4: Using SO, solve the Linear Program of Figure [2j Store the output as tt, p. 
5: Run the decomposition algorithm (Algorithm ED with input F,V,tt. Store the output as M' . 

M' is a distribution over at most nY^Li l^il + 1 simple virtual VCG allocations. 
6: Output the allocation rule M' and pricing rule p — 5 ■ 1 (i.e. when bidder i reports type A, 

charge her pi (A) — 5) . 



Proof of Theorem^ We use Algorithm [H Using the additivity of the bidders, it follows that StepH] 
of the algorithm outputs a reduced form/pricing rule pair (tt , p) that is revenue-optimal with respect 
to all 5-BIC, IR reduced form/pricing rule pairs, except that the reduced forms that are searched 
over belong to F(F, V) and may be infeasible with respect to T>. We proceed to argue that the 
achieved revenue is nearly-optimal with respect to all BIC, IR reduced form/pricing rule pairs for 
which the reduced form lies inside F(F,T>). So let (tt*,p*) denote an optimal such pair, and let 
OPT denote its expected revenue. By Theorem [H we know that, with high probability, there is 
some reduced form tt' £ F(F,T>') satisfying \tt' — tt*\ 00 < 5/2n. So, if we let ff = p* — | • 1, it is 
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Variables: 

• Pi(vi), for all bidders i and types Vi E Tj, denoting the expected price paid by bidder i when 
reporting type Vi over the randomness of the mechanism and the other bidders' types. 

• Kij(vi), for all bidders i, items j, and types vi S T«, denoting the probability that bidder i 
receives item j when reporting type V{ over the randomness of the mechanism and the other 
bidders' types. 

Constraints: 

• T?i(vi) ■ Vi — Pi(vi) > TTi(wi) -Vi —pi(wi) — 5, for all bidders i, and types Vi, Wi £ Tj, guaranteeing 
that the reduced form mechanism (ir,p) is 5-BIC. 

• Ki{vi) ■ Vi — Pi(vi) > 0, for all bidders i, and types Vi £ Tj, guaranteeing that the reduced form 
mechanism (n,p) is individually rational. 

• SO(if) ="yes," guaranteeing that the reduced form 7? is in F(F,V). 
Maximizing: 

• YliLi Y^v-eT- P r [^ = fy] 'Pi{^i)i the expected revenue when played by bidders sampled 
from the true distribution V. 



Figure 2: A linear programming formulation for MDMDP. 




obvious that the reduced form (tt',/?) is <5-BIC. It is also obvious that it is individually rational. 
Finally, it is clear that value of the LP achieved by (tt',$) is exactly m8/2 = e/4 less than the value 
of (tt*,p*). So because (if',j?) is in the feasible region of the LP of Figure[2j the reduced form/price 
rule pair output by Step H] of Algorithm H] has expected revenue at least OPT — e/4. Noticing that 
we subtract an additional 5 from the price charged to each bidder in Step [6] of the algorithm, we 
get that the end price rule makes expected revenue at least OPT — e. 

We argue next that the mechanism output by Algorithm 2] is e-BIC and IR. To see this let M' 
be the allocation rule (computed in Step [5] of the algorithm), which implements the reduced form 
7? (computed in Stepd]) with respect to V. Let also n' denote the reduced form of M' with respect 
to T>. By Theorem [5l we know that, with high probability, | 7T — tt'Ioo — S/^ti. Therefore, given that 
(tt,p) is 5-BIC, we immediately get that (jr',p — 6 ■ 1) is 25-BIC. So allocation rule M' with pricing 
rule p — 5 ■ 1 comprises an e-BIC mechanism. Also because (tt,P) is IR and | 
immediately get that (tC , p — 5-1) is IR, and hence that allocation rule M' with pricing rule p — 5 ■ 1 
is IR. 

Overall the above imply that the allocation rule M' and the pricing rule p — S-1 output in 
Step [6] of our algorithm comprise an e-BIC and IR mechanism, whose pricing rule achieves revenue 
at least OPT — e. Moreover, it is immediate from Theorems [4] and [5] that Algorithm |4] runs in 
time polynomial in c, n, YllLi \1i\, and rijr(poly(n YliL 1 | | , log 1 /e, c) ) , where c is as in the 
statement of the theorem. 

Finally, the way we chose our parameters in Algorithm H] the probability of failure of the 
algorithm is 1 — e~ n ( mn Si \ T i\/ e ) D y Theorems |4]and[5l Trading off probability of error with e (as 
per Remark [1]) we complete the proof of Theorem On 

Theorems [7] and [8] are obtained by combining Theorem [6] with tools developed in |15| . Both 
theorems are based on the following observation, generalizing Theorem 2 of [15] : If T> is item- 
symmetric and the feasibility constraints T are also item-symmetric then there exists an optimal 
mechanism that is: 

1. item-symmetric, i.e. for all bidders i, all types Vi £ Tj, and all item-permutations a, the 
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reduced form of the mechanism satisfies iti(a(vij) = a{TVi(vi)); this means that the reduced 
form on a permuted type of a bidder is the same permutation of the reduced form of the 
un-permuted type of the bidder. 
2. strongly-monotone, i.e. for all bidders i, and items j and f, Vij > Viji =^> iTij(vi) > -Kiji{vi). 

Using this structural observation for optimal mechanisms, we sketch the proofs of Theorems [7J 

and El 

Proof of Theorem^ (Sketch) Given our structural observation for optimal mechanisms, we can — 
without loss of generality — rewrite the LP of Figure [21 while at the same time enforcing the above 
constraints, i.e. searching over the set of item-symmetric, strongly-monotone reduced forms. In- 
deed, to save on computation we can write a succinct LP on variables {^ijivi^ij^eEi where, for 
every bidder i, Ei is a sufficient (based on the above symmetries) set of representative types, e.g. 
we can take Ei = {vi \ vn > . . . > Vi n }. We refer the reader to [8] for the explicit form of the 
succinct LP. The benefit of the succinct formulation is that \Ei\ < n c , where c is as in the statement 
of Theorem [JJ so the size of the succinct LP is polynomial in m, n c and £, where I is as in the 
statement of the theorem. 

But we also need to come up with an efficient separation oracle for our setting. Checking viola- 
tion of the strong- monotonicity property is easy to do in time linear in 0(mn) and the description 
of {wijivi^ijtf^Ei- So it remains to describe a separation oracle determining the feasibility of a 
succinct description {^(^OKi^e,^ of an item-symmetric reduced form. One approach to this 
would be to expand out the succinct description of the item-symmetric reduced form to a full- 
fledged reduced- form and invoke the separation oracle developed in Sections 0] through However, 
this would make us pay computation time polynomial in ^ i |Tj|, and the whole point of using 
item-symmetries is to avoid this cost. To circumvent this, we take the following approach: 

• First, let Fs{J-,T>) be the set of item-symmetric reduced forms that are feasible with respect 
to J- and T>; 

• Fs{J-,T>) is a polytope, as it is the intersection of the polytope F^T^T)) and the item- 
symmetry constraints; moreover, every point in Fs(J-,T>) has a succinct description of the 
form {^ij(vi)}i,j,v t eEi where, for all i, E{ is defined as above; 

• What are the corners of Fs{J-,T>)l These can be implemented by item-symmetric virtual 
VCG allocation rules whose weight-functions are item-symmetric. The proof of this is identical 
to the proof of Proposition [2] noticing that Fs(J-,T>) lies in the lower-dimensional space 
spanned by the item-symmetries. We note that we do not require the virtual VCG allocation 
rules to be simple in the same sense defined in Section [3l as this could violate item-symmetry. 

• However, given an item-symmetric weight vector w how do we even run an item-symmetric 
virtual VCG allocation rule corresponding to w? There are two issues with this: (a) how 
to enforce the item-symmetry of the virtual VCG allocation rule; and (b) there could be 
multiple item-symmetric virtual VCG allocation rules consistent with w, e.g., if w is per- 
pendicular to a facet of Fs{J-,T>). Here is how we resolve these issues: First, we apply 
Lemma H] to the symmetric weight vector w to get a non-symmetric weight vector w' (we 
may do this transformation explicitly or do a lazy-evaluation of it — this is relevant only for 
our computational results three bullets down). When bidders submit their types, we pick a 
permutation a uniformly at random, and permute the names of the items. Then we use the 
simple virtual VCG allocation rule corresponding to w' . Finally, we un-permute the names of 
the items in the allocation. We denote this allocation rule by S.VVCGj^{w). It is clear that 
S.VVCGp(w) is well-defined (i.e. no tie-breaking will ever be required), is item-symmetric, 
and defines a virtual VCG allocation rule w.r.t. the original weight vector w. Given the 
above discussion, every item-symmetric weight vector w has a succinct description of the 
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form {wij (vityijrfieEii which defines uniquely an item-symmetric virtual VCG allocation rule 
w.r.t. w (namely, S.VVCGj^(w)); 

Given the above definitions and interpretations, we can generalize the results of Sections [3] 
and H] to the polytope Fg^, V). 

Next we discuss how to extend the computationally- friendly results of Section [5] to the item- 
symmetric setting, while maintaining the computational complexity of all algorithms poly- 
nomial in m, maxj \Ei\ = 0{n c ) and £, where £ is as in the statement of the theorem. We 
define an item-symmetric distribution D' as follows: We draw k" = k + k' Yl^Li l-^il profiles of 
bidders Pi, ... , P^n from T> as in Section 02 except that, for each bidder i, we draw k' profiles 
conditioning on the type of the bidder being each element of Ei and not Tj. Then we define 
(without explicitly writing down) V' to be the two-stage distribution that in the first stage 
draws a random profile from Pi, ... , Py/ and in the second-stage permutes the items using 
a uniformly random item-permutation. We claim that using T = poly(mn c ) in Algorithm [2] 
suffices to obtain an analog of Theorems 0] and [5] for our setting with probability of success 
1 - e" 1 ^. (We addr ess the running time shortly.) The reason we can save on the number 
of samples is that we are working with Fg^J 7 ,!)) and hence all reduced forms are forced to 
be item-symmetric. So we need to prove concentration of measure for a smaller number of 
marginal allocation probabilities. 

Unfortunately, we cannot afford to compute reduced forms with respect to T>' , as we can't in 
general evaluate the reduced form of S.VVCGj^{w) on a given type profile without making 
prohibitively many queries to A jr. Instead, we will also independently sample item permuta- 
tions (7i, ... , ct/c", and associate the permutation a a with P a in the following sense. If a given 
type profile was sampled by T>' after sampling P a in the first stage of T>' , we will permute 
the items by (just) a a when evaluating S.VVCGj^(w) on that profile, instead of taking a 
uniformly random permutation. In other words, we have removed the randomness in evalu- 
ating S.VVCGjr{w) and fixed the applied item-permutation to some a a , which was chosen 
uniformly at random. Doing so, we still have the same expectations as in the previous bullet, 
and we can deduce that the reduced form of S.VVCGj^(w) when consumers are sampled from 
T> is very close to the reduced form of S.VVCGj^{w) when consumers are sampled from T>' 
(while only using item-permutation a a to run S.VVCGjr{w) on all profiles sampled from T>' 
after sampling P a in the first stage of V, as explained above). 

With the above modifications, for both Theorems U] and [5] the running time of the corre- 
sponding algorithm can be made polynomial in £', m, n c , 1/e and rtjF(poly(ra c , m, log 1/e, £')), 
where £' is the max of £ (see statement) and the bit complexity of the coordinates of the 
input to the algorithms. To achieve this we only do computations with succinct descriptions 
of item-symmetric reduced forms and weight vectors. What we need to justify further is 
that we can do exact computations on these objects with respect to the distribution D' given 
oracle access to Ajr in the afore-stated running time. For this it suffices to be able compute 
the succinct description of the reduced form tt of S.VVCGj^(w) (using permutation a a on 
profiles coming from P a as explained above). The small obstacle is that the support of T>' 
is not polynomial in the required running time, but it suffices to do the following. For each 
profile P a find the allocation output by the (non item-symmetric) virtual VCG allocation rule 
corresponding to the perturbed vector w' , after relabeling the items according to a a . This 
we can do in the allotted running time with a lazy evaluation of the perturbation. Then, for 
all i, Vi and j, to compute irij(vi) do the following: for all profiles P a , let x a (i,iJi) denote the 
number of r such that r(uj) matches the type U(a) of bidder i in P a . Let y a {i, Vi) denote the 
number of r such that r(vi) matches the type of bidder i in P a , and item r(j') is awarded to 
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bidder i. It is easy to compute x a (i, v{): let J* = {j\vij = v}, and J% be the set of items that 
bidder i values at v in profile P a . Then if \ J^\ / \ J%\ for any u, x a {i,Vi) = 0. Otherwise, 
Xa(i,Vi) = Y\ v \Jy\\, because r(vi) matches the type of bidder i iff r maps all of J* to for 
all v. Computing y a (i, V{) is also easy: simply break up into two sets: J^(W) of items that 
bidder i values at v and wins, and J%(L) of items that bidder i values at v and loses. Then 
if x a (i,Vi) / 0, y a (i,Vi) = |J^.(W)| • (|J^.| - 1)! • l\ v ^ Vij \ J%\h because bidder i is awarded 
item r(j) and r(vi) matches the type of bidder i iff r(j) G Jjf (W), and r maps J~ to J% for 
all Once we've computed x a (i, Vi) and y a (hVi), it is easy to see that: 



The above bullet points explain briefly how our ideas from the previous sections are modified for 
item-symmetric distributions. The complete details are omitted. □ 

Proof of Theorem^ We combine Theorem [7] with (i) a discretization of the hypercube so that every 
v in the support of the distribution has Vij = kd, k G N, for all and (ii) the approximately-BIC 
to BIC reduction of Section 6 of [15] . informally stated below. 

Informal Theorem 3. (Reworded from 115)1 ) Let C = x^Cj and C' = XjC^ be product distributions 
sampling every additive bidder independently from [0, l] n . Suppose that, for all i, Ci and C[ can be 
coupled so that with probability 1, Vi sampled from Ci and sampled from C[ satisfy > v'^ > Vij—5 
for all j. If M' is any e-BIC mechanism for C , then with exact knowledge of the reduced form 
of M 1 with respect to C , we can transform M' into a BIC mechanism for C while only losing 
revenue, where C is the maximum number of items that are allowed to be allocated 
simultaneously. In item symmetric settings, the reduction runs in time polynomial in n l / & ,m. 

While doing the discretization is straihgtforward, there is an issue with applying the afore- 
mentioned approximately-BIC to BIC reduction. To directly apply the reduction in our setting, 
one might try to take C to be the V from the sampling procedure. Unfortunately, this doesn't 
work because V is correlated across bidders. Instead, we might try to take C to be V. This too 
doesn't work because we can't exactly compute the reduced form of a mechanism with respect to 
T>. We can, however, compute the reduced form of a mechanism with respect to T> with quite good 
accuracy. So we will prove a quick lemma about the quality of the reduction proposed in [15] in our 
setting. Virtually the same lemma is used in [T7] where the ideas behind this reduction originated, 
but in a different setting. 

Lemma 12. Let C and C satisfy the hypotheses of Theorem 4 in }15f , and let M' be a j-BIC 
mechanism whose reduced form with respect to C is tt' (which is possibly unknown). Then with 
knowledge of some n such that \tt — tt'\i < e, the reduction of I15f transforms M' into a 2e-BIC 
mechanism for C while only losing 0{C{y/5 -\- ^f)) revenue, where C is as above. 

Proof of Lemma \12[ We avoid repeating a complete description of the reduction and refer the 
reader to [15] for more details. At a high level, the reduction is the following. The new mechanism 
M is a two-stage mechanism. First, each bidder i, independently from the other bidders, plays a 
VCG auction against make-believe replicas of herself, drawn independently from Ci, to purchase 
a surrogate from a collection of surrogates drawn independently from C[. (The items of the per- 
bidder VCG auction are the surrogates and, in particular, they have nothing to do with the items 
that M is selling. Moreover, the feasibility constraints of the VCG auction are just unit-demand 
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constraints on the bidder-side and unit-supply constraints on the item-side.) After each bidder buys 
a surrogate, the purchased surrogates play M' against each other. Each bidder receives the items 
their surrogate is awarded and pays the price the surrogate pays in M', as well as a little extra 
in order to buy the surrogate in the per-bidder VCG auction. The truthfulness of the two-stage 
mechanism M boils down to the truthfulness of VCG: If we can exactly evaluate the value of bidder 
i of type Vi for being represented by each surrogate Sj, and we use these values in computing the 
VCG allocation in the per-bidder VCG auction, then M is BIG Moreover, it is shown in [15] that 
the distribution of surrogates that play M' is exactly C So, this implies that, if we know exactly 
the reduced form of M' with respect to C, we can exactly compute the value of the bidder for each 
surrogate, and M will be BIG 

If we only know the reduced form of M' with respect to C within e in ^-distance, then we 
cannot exactly evaluate the value of bidder Hi for being represented by surrogate s*j, but we can 
evaluate it within e. Suppose that we run per-bidder VCG auctions using our estimates. In the 
VCG auction corresponding to bidder i, suppose Vi is the true type of the bidder, let tx\ be our 
estimate of the reduced form of the surrogate that was sold to the bidder, let p\ be the price that 
surrogate pays in M', and let q\ denote the price paid for that surrogate in VCG. Let (tt2,P2,c[2) 
be the corresponding triplet for another surrogate that the bidder would win by misreporting her 
type. By the truthfulness of VCG, 

v i -7r 1 -p 1 -q 1 >Vi-Tr 2 -p2- <?2- 

The question is how much the bidder regrets not misreporting to the VCG auction given that the 
true reduced form of surrogate Si in M' is some tt^, which was false-advertised in the VCG auction 
as 7Tj, i = 1, 2. Given that |vf^ — vr.j|i < e and remembering that each G [0, l] n we get: 

Vi ■ TT i > Vi ■ TT i - e 
Vi ■ 7T 2 > Vi ■ TT 2 ~ e - 

Therefore, 

Vi-Tr' 1 -p 1 - q 1 >v i -TT 2 -p2- q2~ 2e. 

This means that if bidder i were to misreport her type to get surrogate s*2, her true utility would 
increase by at most 2e. So M is 2e-BIC. 

Finally, we can use the same argument as in [15] to show that the reduction loses at most 
0(C(V& + y/j)) in revenue. □ 

Coming back to the proof of Theorem El if we just discretized the value distribution without running 
the approximately-BIC to BIC reduction of Lemma [L2| we would get a mechanism that is (e + 5)- 
BIC and suboptimal by (e + 5)C (Lemma 3 of [15] pins down the loss of truthfulness/revenue due 
to discretization of the value distribution in multiples of 5, and Theorem [7] bounds the additional 
loss due to computational constraints). If we apply the reduction of Lemma [12] on this mechanism, 
we can turn it into one that is e-BIC and suboptimal by 0(y/e + \^6)C. The reason we even bother 
running the approximately-BIC to BIC reduction when we don't get a truly BIC mechanism in the 
end is because, while keeping our algorithm efficient, e can be made as small as l/poly(n, m), while 
5 needs to stay a fixed constant. So after our reduction we obtained a qualitatively stronger result, 
namely one whose distance from truthfulness can be made arbitrarily small in polynomial time. □ 

H Accommodating Budget Constraints 

In this section we show a simple modification to our solutions that allows them to accommodate 
budget constraints as well. To do this, we simply cite an observation from [15]. There, it is observed 
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that if the solution concept is interim individual rationality, then the LP that finds the revenue- 
optimal reduced form can be trivially modified to accommodate budget constraints. In Figure [3] 
we show how to modify our LP from Figure [2] to accommodate budget constraints. 



Variables: 

• Pi{vi), for all bidders i and types V{ £ Ti, denoting the expected price paid by bidder i when 
reporting type Vi over the randomness of the mechanism and the other bidders' types. 

• Kij(vi), for all bidders i, items j, and types Vi E Tj, denoting the probability that bidder i 
receives item j when reporting type v*i over the randomness of the mechanism and the other 
bidders' types. 

Constraints: 

• Tfi(vi) ■ Vi — Pi(vi) > TTi(wi) -Vi —pi(wi) — 5, for all bidders i, and types Vi, Wi £ Ti, guaranteeing 
that the reduced form mechanism (if,p) is 5-BIC. 

• TTi(vi) • Vi — Pi(vi) > 0, for all bidders i, and types Vi £ Ti, guaranteeing that the reduced form 
mechanism (n,p) is individually rational. 

• SO(if) ="yes," guaranteeing that the reduced form if is in F(F,V). 

• Pi{vi) < Bi, for all bidders i and types V{ G Ti, guaranteeing that no bidder i pays more 
than their budget Bi. 

Maximizing: 

• YaLi Ylvi&Ti P r [^j = Vi] ' Pi(vi)i t ne expected revenue when played by bidders sampled from 
the true distribution V. 



Figure 3: A linear programming formulation for MDMDP that accommodates budget constraints. 



It is also shown in pjy that accommodating budget constraints comes at a cost. First, it is 
shown that without budget constraints, one can turn any interim-IR mechanism into an ex-post 
IR mechanism with no loss in revenue. However, with budget constraints, there is a potentially 
large gap between the revenue of the optimal ex-post IR mechanism and the optimal interim IR 
mechanism. In other words, accommodating budget constraints requires accepting interim IR 
instead of ex-post IR. Second, the approximately-BIC to BIC reduction of [15] (used in the proof 
of Theorem [8|) does not respect budget constraints. So to accommodate budgets in Theorem [8] 
the output mechanism needs to be <5-BIC instead of e-BIC0 Relative to the ability to naturally 
accommodate budget constraints, these costs are minor, but we state them in order to correctly 
quantify the settings our techniques solve. 
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